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cDNA ENCODING A TRAG GENE (TGF- p RESISTANCE ASSOCIATED GENE) 

AND ITS PROTEIN PRODUCT 



This application is being filed as a PCT International Patent application on 
5 February 12, 2001, designating all countries, in the name of The Government of the 
United States, represented by the Secretary, Department of Health and Human Services 
(applicant for all countries except the U.S.), and in the names of Snorri S. Thorgeirsson, 
a U.S. citizen, and Sean Sanders, a South African citizen (applicants for U.S. only). 

10 FIELD OF THE INVENTION 

The present invention relates generally to the isolation and characterization of 
novel DNA and polypeptides, designated herein as "TRAG." 

BACKGROUND OF THE INVENTION 

1 5 The ligand transforming growth factor-beta (TGF-p) has a wide range of 

physiological and pathological effects in epithelial cells (which includes liver cells), 
including inhibition of cell proliferation, stimulation of cell differentiation, matrix 
production, and apoptosis. (J. MassaguS, Annual Rev. Cell Biol, 6:597 (1990)). TGF-p 
can suppress uncontrolled cell proliferation and induce cell death (apoptosis) in 

20 damaged/redundant cells by signaling through membrane receptors (TGF-p receptor type 
I, TpRI; TGF-p receptor type II, TpRII) and intracellular signal transduction proteins 
(for example, SMAD2, SMAD3, and SMAD4). (M. Kawabata et al., J. Biochem.. 125:9 
(1999)). More importantly, many tumor types, and particularly those in the liver, have 
been shown to lose sensitivity to TGF-p. That is, these cells are not susceptible to 

25 abrogation of cell division or death induced by TGF-p. This can occur in a number of 
different ways, including through loss of one of the two TGF-p receptors or through 
disruption of the intracellular signaling pathway (such as loss of mutation of one of the 
SMAD proteins). (See, for example, K.R. Cho et al., J. Biol. Chem. Suppl.. 137 (1992) 
and A. Kiss et al., Clin. Cancer Res.. 3:1059 (1997)). 

30 Recently, a number of new proteins that interact with one or more components of 

the TGF-p signaling pathway have been discovered. These include TRIP-1, which has 
been shown to associate with, and be biochemically modified by, TpRII, and STRAP, 
' which can interact with both TpRI and TpRII. (R.H. Chen et al., Nature. 377:548 (1995) 
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and P.K. Datta et aL J. Biol. Chem.. 273(52):34671 (1998)). Both of these proteins 
J contain conserved domains known as WD repeats. These repeats are -40 amino acids in 

length and follow a loosely conserved consensus sequence. (E.J. Neer et al., Nature, 
371 :297 (1994)). WD repeat-containing proteins are believed to interact through these 
5 conserved domains and are involved in a plethora of cellular and molecular pathways 
including signal transduction, gene regulation, protein trafficking, and RNA processing. 
(EJ. Neer et al., Nature . 371 :297 (1994)). 

It is desirable to identify and . characterize new cellular genes encoding proteins 
involved in TGF-p signaling. The identification and characterization of new genes 
10 encoding proteins involved in TGF-p signaling are particularly useful for better 
understanding TGF-p signaling pathways such as, for example, useful for studying 
mechanisms of cell proliferation and the means to modulate such activity, and 
particularly useful for screening targets for cancer therapy. 

15 SUMMARY OF THE INVENTION 

Applicants have identified nucleotide sequences that encode a novel polypeptide, 
designated in the present application as "TRAG" (TGF-p Resistance Associated Gene), 
which exhibits a number of characteristics that make it a useful tool for studying cell- 
cycle cpntrol and oncogenesis. TRAG is a previously undescribed gene. TRAG has 

20 multiple WD repeat elements and two tyrosine phosphorylation motifs. WD repeat 
elements are known to be involved in protein-protein interactions, which control many 
aspects of cell growth, differentiation, and survival. Tyrosine phosphorylation motifs can 
play a role in the modulation of protein function. As such, this novel protein has a variety 
of applications in the identification, characterization, and regulation of activities 

25 associated with cellular regulation as well as processes associated with oncogenesis. It is 
believed that TRAG plays an important role in TGF-p signal transduction, normal cell 
function, and organogenesis. 

In one embodiment, the invention provides an isolated nucleic acid molecule that 
includes nucleotides that encode a TRAG polypeptide. For example, the isolated nucleic 

30 acid can include DNA encoding a TRAG polypeptide having amino acid residues 1 to 

1488 of Table 1, amino acid residues 1 to 1490 of Table 2, or amino acid residues 1 to 

1489 of Table 3 or can include DNA complementary to such an encoding nucleic acid 
sequence of Table 1, 2, or 3, which remains stably bound to it under at least moderate, 



2 



WO 01/66739 PCT/US01/04475 

and optionally, under high stringency conditions. In another embodiment, the invention 
provides a vector comprising a gene encoding a TRAG polypeptide. A host cell 
comprising such a vector is also provided. By way of example, the host cells may be £. 
coli, yeast, insect, fungal, or mammalian cells. A process for producing TRAG 
5 polypeptides is further provided and comprises culturing host cells under conditions 
suitable for expression of TRAG. If desired, the TRAG polypeptide can be recovered. 

In one embodiment, the invention provides an isolated polypeptide comprising a 
TRAG fragment, wherein the TRAG fragment comprises multiple WD repeat elements 
and two tyrosine phosphorylation motifs. In a favored embodiment, the isolated 

10 polypeptide exhibits TRAG-like activity and, typically, is capable of interacting with 
other WD repeat element-containing proteins. In another embodiment, the invention 
provides isolated TRAG polypeptide. In particular, the invention provides isolated 
native sequence TRAG polypeptide, which in one embodiment, includes an amino acid 
sequence comprising residues 1 to 1488 of Table 1, an amino acid sequence comprising 

15 residues 1 to 1490 of Table 2, or an amino acid sequence comprising residues 1 to 1489 
of Table 3. In a related embodiment, the invention provides chimeric molecules 
comprising TRAG polypeptide fused to a heterologous polypeptide or amino acid 
sequence. An example of such a chimeric molecule is a factor that includes a TRAG 
fused to a protein such as the maltose-binding protein. In yet another embodiment, the 

20 invention provides a polypeptide capable of specifically binding a TRAG polypeptide 
such as an antibody specific for a TRAG polypeptide. 

In other embodiments, the invention provides methods for using TRAG- 
RELATED polypeptides and nucleic acids for studying and modulating mechanisms 
involved in cellular proliferation. In one embodiment, the invention provides a method 

25 of modulating cellular phenotype by controlling the level of TRAG expression within the 
cell. For example, mammalian cells can be transfected with a DNA vector encoding a 
TRAG polypeptide having the amino acid sequence of Table 1, 2, or 3. The TRAG 
polypeptide can be expressed in the cells, and cells having an altered phenotype such as, 
for example, cancer cells or cells having aggressive metastasis, can then be selected. 

30 Alternatively, the invention provides a method of reducing TRAG expression via 

antisense oligonucleotides to effect a cellular phenotype such as TGF-p sensitivity. In a 
related embodiment, the invention provides methods for effecting the interaction between 
TRAG and other WD repeat element-containing proteins. 

3 
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BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1A shows the normal expression pattern of TRAG in various rat tissues 
according to a Northern blot analysis. This Northern blot shows mRNA (total and poly- 
5 A" 1 ) extracted from 3 rat cell lines (RLE, B5T, and C4T). 

Figure IB shows a graphical representation of the data from Figure 1 A after the 
levels of TRAG mRNA were quantified. Data was quantified using a densimetric scan 
of the phosphoimager plate (to which the radioactive membrane was exposed) by making 
use of ImageQuant*™ Version 3.3 software (Molecular Dynamics). 

10 Figure 2A shows the genetic localization of the TRAG gene to chromosome 1 8 

in mouse (region is 1 8D. 1-E.3). This localization was determined by using a DNA 
probe complementary to a portion of the TRAG gene, covering exons 14 to 21 (left 
panel). The right panel shows a similar experiment, but performed using a DNA probe 
that reacts exclusively with the entire mouse chromosome 18. 

15 Figure 2B shows the genetic localization of the TRAG gene to chromosome 1 8 

in human (region is 18q 21.1-22). This localization was determined by using a ~5kb 
DNA probe against a centrally located region of the TRAG gene spanning exons 12 to 14 
(left panel). The right panel shows a similar experiment, but performed using a DNA 
probe that reacts exclusively with the entire human chromosome 18. 

20 Figure 3 A shows the size and qualitative levels of TRAG mRNA expressed in 

normal rat tissue from heart, brain, spleen, lung, liver, muscle, kidney, and testis. mRNA 
transcripts having sizes of approximately 7.2 kb and 3.5 kb were identified. The brain 
tissue shows a smear of multiple mRNAs, which can be the result of alternative splicing 
of a single mRNA transcript. 

25 Figure 3B shows a dot blot hybridization using a commercial membrane 

containing rat mRNA from brain, heart, gastrointestinal tract (GIT), numerous internal 
organs including liver and testis (indicated), tumor cell lines (labeled "lines"), and fetal 
tissue (from top: brain, heart, kidney, liver, spleen, thymus, and lung). TRAG mRNA 
was expressed in all tissues but appeared to be most abundant in brain. 

30 Figure 3C shows a dot blot hybridization using a commercial membrane 

containing mouse mRNA from (top, L to R) brain, eye, liver, lung, kidney, heart, skeletal 
muscle, smooth muscle, pancreas, thyroid, thymus, submaxillary gland, spleen, testis, 
ovary, prostate, epididymus, uterus, and embryo (7 days, 11 days, 15 days, and 17 days). 
TRAG expression was identified in all tissues. 
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Figure 4 shows a Western blot of TRAG protein extracted from three rat cell 
lines (B5T, C4T, and RLE phi 13). This blot shows elevated levels of TRAG protein in 
transformed B5T and C4T cells relative to untransformed RLE cells. 

Figure 5 A shows a Western blot of TRAG protein extracted from chemically 
5 transformed rat cell lines (AFL-B8 and AFL-D8) and virally transformed rat cell lines 
(361 1T2, 361 1T5, and J2-14). This blot also includes samples from RLE and B5T cells 
as negative and positive controls, respectively. A Northern blot, which shows the 
expression of TRAG mRNA in these cell lines, is also shown. 

Figure 5B shows a Western blot of TRAG protein extracted from tumor cell lines 
10 derived from double transgenic c-myc/TGF-a mice. A Northern blot, which shows the 
TRAG mRNA levels in these cell lines, is also shown. 

Figure 5C shows a Western blot of TRAG protein extracted from Alex, Chang, 
FOCUS, HepG2, Huh-7, Sk-Hep-1, WRL-68, and HeLa human tumor cell lines. A 
Northern blot, which shows the TRAG mRNA levels in these cell lines, is also shown. 
15 Figure 5D shows a Western blot of TRAG protein extracted from primary tumors 

taken from double transgenic c-rnyc/TGF-a mice. This blot also includes samples from 
RLE and B5T cells as negative and positive controls, respectively. 

Figure 6 A shows confocal microscope images of RLE and B5T cells transfected 
with either a control green fluorescent protein (GFP only) plasmid or with a vector 
20 containing the TRAG gene fused to the green fluorescent protein (TRAGr-GFP). The 
images show that the TRAG protein is localized to the cytoplasm of both RLE and B5T 
cells. 

Figure 6B shows an immunohistochemical stain of paraformaldehy de-fixed RLE 
and B5T cells. The TRAG protein was stained by using an anti-TRAG polyclonal 
25 antibody. The stain shows that the TRAG protein is localized to the cytoplasm of both 
RLE and B5T cells and appears to be perinuclear. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Definitions 

30 The terms "TRAG polypeptide" and "TRAG 11 when used herein encompass native 

sequence TRAG and TRAG variants (which are further defined herein). TRAG may be 
isolated from a variety of sources, such as from human tissue types or from another 
source, or prepared by recombinant or synthetic methods. 
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The TRAG polypeptide, which may be a fragment of a native sequence, contains 
multiple WD repeat elements and 2 tyrosine phosphorylation motifs. 

A "native sequence TRAG n is a polypeptide having the same amino acid 
sequence as TRAG derived from nature. Such native sequence TRAG can be isolated 
5 from nature or can be produced by recombinant or synthetic means. The term "native 
sequence TRAG" specifically encompasses naturally-occurring variant forms (e.g., 
alternatively spliced forms) and naturally-occurring allelic variants of the TRAG. In one 
embodiment of the invention, the native sequence TRAG is a mature or full-length 
native sequence TRAG polypeptide comprising amino acids 1 to 1488 of Table 1. 

10 Alternately, the TRAG polypeptide comprises amino acid residues 1 to 1490 of Table 2 
or amino acid residues 1 to 1489 of Table 3. 

"TRAG variant" means a functionally active TRAG as defined below having at 
least about 80% amino acid sequence identity with TRAG, such as the TRAG 
polypeptide having the deduced amino acid sequence shown in Tables 1, 2, or 3 for a 

15 full-length native sequence TRAG. Such TRAG variants include, for instance, TRAG 
polypeptides wherein one or more amino acid residues are added to or deleted from the 
N-terminus or C-terminus of the sequence of Tables 1, 2, or 3. Ordinarily, a TRAG 
variant will have at least about 80% or 85% amino acid sequence identity with native 
TRAG sequences, more preferably at least about 90% amino acid sequence identity, 

20 Even more preferably a TRAG variant will have at least about 95% amino acid sequence 
identity with the native TRAG sequence of Tables 1, 2, or 3. As noted above, TRAG 
variants include multiple WD repeat elements and 2 tyrosine phosphorylation motifs. 
Functionally active TRAG variants typically have at least about 50 amino acid residues 
and preferably at least about 100 amino acid residues. 

25 "Percent (%) amino acid sequence identity" with respect to the TRAG sequences 

identified herein is defined as the percentage of amino acid residues in a candidate 
sequence that are identical with the amino acid residues in the TRAG sequence after 
aligning the sequences in the same reading frame and introducing gaps, if necessary, to 
achieve the maximum percent sequence identity, and not considering any conservative 

30 substitutions as part of the sequence identity. Alignment for purposes of determining 
percent amino acid sequence identity can be achieved in various ways that are within the 
skill in the art, for instance, using publicly available computer software such as BLAST 
software. Those skilled in the art can determine appropriate parameters for measuring 
alignment, including any algorithms needed to achieve maximal alignment over the full 
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length of the sequence being compared. 

"Percent (%) nucleic acid sequence identity" with respect to the TRAG sequences 
identified herein is defined as the percentage of nucleotides in a candidate sequence that 
are identical with the nucleotides in the TRAG sequence, after aligning the sequences 
5 and introducing gaps, if necessary, to achieve the maximum percent sequence identity. 
Alignment for purposes of determining percent nucleic acid sequence identity can be 
achieved in various ways that are within the skill in the art, for instance, using publicly 
available computer software. Those skilled in the art can determine appropriate 
parameters for measuring alignment, including any algorithms needed to achieve 

10 maximal alignment over the full length of the sequences being compared. 

The term "epitope tagged" when used herein refers to a chimeric polypeptide 
comprising TRAG, or a functional fragment thereof, fused to a "tag polypeptide." The 
tag polypeptide has enough residues to provide an epitope against which an antibody can 
be made, or which can be identified by some other agent, yet is short enough that it does 

15 not interfere with the activity of TRAG. The tag polypeptide preferably also is 

sufficiently unique so that the antibody does not substantially cross-react with other 
epitopes. Suitable tag polypeptides generally have at least six amino acid residues and 
usually between about 8 and about 50 amino acid residues (preferably, between about 10 
and about 20 residues). 

20 "Isolated," when used to describe the various polypeptides disclosed herein, 

means polypeptide that has been identified and separated and/or recovered from a 
component of its natural environment. Contaminant components of its natural 
environment are materials that would typically interfere with diagnostic or therapeutic 
uses for the polypeptide, and may include enzymes, hormones, and other proteinaceous 

25 or nonproteinaceous solutes. In preferred embodiments, the polypeptide will be purified 
to a degree sufficient to obtain N-terminal or internal amino acid sequence by use of a 
spinning cup sequenator or to homogeneity by SDS-PAGE under nonreducing or 
reducing conditions using Coomassie blue or silver stain. Isolated polypeptide includes 
polypeptide in situ within recombinant cells because at least one component of the 

30 TRAG natural environment will not be present. Ordinarily, however, isolated 

polypeptide will be prepared by at least one purification step (referred to herein as an 
"isolated and purified polypeptide"). 

An "isolated" TRAG nucleic acid molecule is a nucleic acid molecule that is 
identified and separated from at least one contaminant nucleic acid molecule with which 



7 



WO 01/66739 PCT/US01/04475 

• 4. 

is ordinarily associated in the natural source of the TRAG nucleic acid. An isolated 
TRAG nucleic acid molecule is other than in the form or setting in which it is found in 
nature. Isolated TRAG nucleic acid molecules therefore are distinguished from the 
TRAG nucleic acid molecule as it exists in natural cells. However, an isolated TRAG 
5 nucleic acid molecule includes TRAG nucleic acid molecules contained in cells that 
ordinarily express TRAG where, for example, the nucleic acid molecule is in a 
chromosomal location different from that of natural cells. 

The term "control sequences" refers to DNA sequences necessary for the 
expression of an operably linked coding sequence in a particular host organism. The 

ID control sequences that are suitable for prokaryotes, for example, include a promoter, 
optionally an operator sequence, and a ribosome binding site. Eukaryotic cells are 
known to utilize promoters, polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional relationship 
with another nucleic acid sequence. For example, DNA for a presequence or secretory 

1 5 leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein that 
participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding 
site is operably linked to a coding sequence if it is positioned so as to facilitate 
translation. Generally, "operably linked" means that the DNA sequences being linked are 

20 contiguous, and, in the case of a secretory leader, contiguous and in reading phase. 
However, enhancers do not have to be contiguous. Linking may be accomplished by 
ligation at convenient restriction sites. If such sites do not exist, the synthetic 
oligonucleotide adaptors or linkers may be used in accordance with conventional 
practice. 

25 "Polynucleotide" and "nucleic acid" refer to single- or double-stranded molecules, 

which may be DNA, comprised of the nucleotide bases A, T, C and G, or RNA, 
comprised of the bases A, U (substitutes for T) , C, and G. The polynucleotide may 
represent a coding strand or its complement. Polynucleotide molecules may be identical 
in sequence to the sequence that is naturally occurring or may include alternative codons 

30 that encode the same amino acid as that which is found in the naturally occurring 
sequence (See Lewin, Genes V , Oxford University Press, Chapter 7, pp. 171-174 
(1994)). Furthermore, polynucleotide molecules may include codons that represent 
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conservative substitutions of amino acids as described. The polynucleotide may 
represent genomic DNA or cDNA. 

"Polypeptide" refers to a molecule comprised of amino acids that correspond to 
those encoded by a polynucleotide sequence that is naturally occurring. The polypeptide 
5 may include conservative substitutions where the naturally occurring amino acid is 
replaced by one having similar properties, where such conservative substitutions do not 
alter the function of the polypeptide (See Lewin, Genes V , Oxford University Press, 
Chapter 1, pp. 9-13(1994)). 

The term "antibody" is used in the broadest sense and specifically covers single 
10 anti-TRAG monoclonal antibodies (including agonist, antagonist, and neutralizing 
antibodies) and anti-TRAG antibody compositions with polyepitopic specificity. The 
term "monoclonal antibody" as used herein refers to an antibody obtained from a 
population of substantially homogeneous antibodies, i.e., the individual antibodies 
comprising the population are identical except for possible naturally-occurring mutations 
15 that may be present in minor amounts. 

The term "mammal" as used herein refers to any mammal classified as a mammal, 
including humans, cows, horses, dogs, and cats. In a preferred embodiment of the 
invention, the mammal is a human. 

20 II. Compositions and Methods of the Invention 

A. TRAG Nucleic Acids and Polypeptides 

The present invention provides newly identified and isolated nucleotide 
sequences encoding polypeptides referred to in the present application as TRAG. In 
particular, Applicants have identified and isolated genes and cDNA encoding TRAG 

25 polypeptides, as disclosed in further detail in the Examples below. Using sequence 
homology searches, Applicants found that TRAG (as shown in Tables 1, 2, and 3) 
contains multiple WD repeat elements, which are found in other proteins such as TRIP-1 
and STRAP, known to interact with proteins in the TGF-p signaling pathway. Two 
tyrosine phosphorylation motifs were also identified. These motifs can play a role in the 

30 modulation of protein function through addition or removal of phosphate molecules at 
the tyrosine residue. 

In addition to the fulMength native sequence TRAG and soluble forms of TRAG 
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described herein, it is contemplated that TRAG variants can be prepared. TRAG variants 
can be prepared by introducing appropriate nucleotide changes into the TRAG nucleotide 
sequence, or by synthesis of the desired TRAG polypeptide. Those skilled in the art will 
appreciate that amino acid changes may alter post-translational processes of the TRAG, 

5 such as changing the number or position of glycosylation sites or altering the protein 
binding characteristics. Variations in the native full-length sequence TRAG or in 
various domains of the TRAG described herein, can be made, for example, using any of 
the techniques and guidelines for conservative and nonconservative mutations set forth, 
for instance, in U.S. Patent No. 5,364,934. For example, amino acid substitutions within 

10 the tyrosine phosphorylation motifs are contemplated, such as conservative substitutions 
at one or both of these residues. 

Variations may be a substitution, deletion, or insertion of one or more codons 
encoding the TRAG that results in a change in the amino acid sequence of the TRAG as 
compared with the native sequence TRAG. Optionally the variation is by substitution of 

15 at least one amino acid with any other amino acid in one or more of the domains of the 
TRAG. Guidance in determining which amino acid residue may be inserted, substituted, 
or deleted without adversely affecting the desired activity may be found by comparing the 
sequence of the TRAG with that of homologous known protein molecules and 
minimizing the number of amino acid sequence changes made in regions of high 

20 homology. Amino acid substitutions can be the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, such as the 
replacement of a leucine with a serine, i.e., conservative amino acid replacements. 
Insertions or deletions may optionally be in the range of 1 to 5 amino acids. The 
variation allowed may be determined by systematically making insertions, deletions or 

25 substitutions of amino acids in the sequence and testing the resulting variants for activity 
in any of the in vitro assays described in the Examples below. 

It is a well-established principle of protein chemistry that certain amino acid 
substitutions, entitled "conservative amino acid substitutions," can frequently be made in a 
protein without altering either the conformation or the function of the protein. Such 

30 changes include substituting any of isoleucine (T), valine (V), and leucine (L) for any other 
of these hydrophobic amino acids; aspartic acid (D) for glutamic acid (E) and vice versa; 
glutamine (Q) for asparagine (N) and vice versa; and serine (S) for threonine (T) and vice 
versa. Other substitutions can also be considered conservative, depending on the 
environment of the particular amino acid and its role in the three-dimensional structure of 
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the protein. For example, glycine (G) and alanine (A) can frequently be interchangeable, as 
can alanine and valine (V). 

Methionine (M), which is relatively hydrophobic, can frequently be interchanged 
with leucine and isoleucine, and sometimes with valine. Lysine (K) and arginine (R) are 
5 frequently interchangeable in locations in which the significant feature of the amino acid 
residue is its charge and the differing pK's of these two amino acid residues are not 
significant. Still other changes can be considered "conservative" in particular 
environments. 

Variations can be made using methods known in the art such as site-directed 

10 mutagenesis, alanine scanning, and PCR mutagenesis. Site-directed mutagenesis (Carter 
et al., Nucl. Acids Res., 13:4331 (1986); Zoller et al., Nucl. Acids Res. . 10:6487 (1987)), 
cassette (Wells et al, Gene, 34:315 (1985)), restriction selection mutagenesis (Wells et 
al., Philos. Trans. R. Soc. London SerA . 317:415 (1986)) or other known techniques can 
be performed on the cloned DNA to produce the TRAG variant DNA. Scanning amino 

15 acid analysis can also be employed to identify one or more amino acids along a 

contiguous sequence. Among the preferred scanning amino acids are relatively small, 
neutral amino acids. Such amino acids include alanine, glycine, serine, and cysteine. 
Alanine is typically a preferred scanning amino acid among this group because it 
eliminates the side-chain beyond the beta-carbon and is less likely to alter the main- 

20 chain conformation of the variant. Alanine is also typically preferred because it is the 
most common amino acid. Further, it is frequently found in both buried and exposed 
positions (Creighton, The Proteins. (W.H. Freeman & Co., N.Y.); Chothia, J. Mol. Biol.. 
150:1 (1976)). If alanine substitution does not yield adequate amounts of variant, an 
isoteric amino acid can be used. 

25 As discussed above, redundancy in the genetic code permits variation in TRAG 

gene sequences. In particular, one skilled in the art will recognize specific codon 
preferences by a specific host species and can adapt the disclosed sequence as preferred 
for a desired host. For example, preferred codon sequences typically have rare codons 
(i.e., codons having a useage frequency of less than about 20% in known sequences of 

30 the desired host) replaced with higher frequency codons. Codon preferences for a 
specific organism may be calculated, for example, by utilizing codon usage tables 
available on the INTERNET at the following address: 

http://www.dna.affrc.gojp/-nakamura/codon.html. Nucleotide sequences that have been 
optimized for a particular host species by replacing any codons having a useage 
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frequency of less than about 20% are referred to herein as "codon optimized sequences." 



Additional sequence modifications are known to enhance protein expression in a 
cellular host. These include elimination of sequences encoding spurious polyadenylation 
5 signals, exon/intron splice site signals, transposon-like repeats, and/or other such well- 
characterized sequences that may be deleterious to gene expression. The GC content of 
the sequence may be adjusted to levels average for a given cellular host, as calculated by 
reference to known genes expressed in the host cell. Where possible, the sequence may 
also be modified to avoid predicted hairpin secondary mRNA structures. Other useful 
modifications include the addition of a translational initiation consensus sequence at the 
start of the open reading frame, as described in Kozak, Mol. Cell Biol.. 9:5073-5080 v 
(1989). Nucleotide sequences that have been optimized for expression in a given host 
species by elimination of spurious polyadenylation sequences, elimination of exon/intron 
splicing signals, elimination of transposon-like repeats, and/or optimization of GC 
content in addition to codon optimization are referred to herein as an "expression 
enhanced sequence." 

B. Modifications of TRAG 

Covalent modifications of TRAG are included within the scope of this invention. 
One type of covalent modification includes reacting targeted amino acid residues of the 
TRAG with an organic derivatizing agent that is capable of reacting with selected side 
chains or the N- or C- terminal residues of the TRAG. Derivatization with bifunctional 
agents is useful, for instance, for crosslinking TRAG to a water-insoluble support matrix 
or surface for use in the method for purifying anti-TRAG antibodies, and vice-versa. 
Commonly used crosslinking agents include, e.g., l,l-bis(diazoacetyl)-2-phenylethane, 
glutaraldehyde, N-hydroxysuccinimide esters, for example, esters with 4-azidosalicylic 
acid, homobifunctional imidoesters, including disuccinimidyl esters such as 3,3 ? - 
dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N-maleimido- 
1,8-octane and agents such as me1hyl-3-[(p-a2ddophenyl)dithio]propioimidate. In the 
alternative, TRAG can be joined to a detectable label such as a radioactive isotope such 
as I 125 or P 32 , an enzyme such as horseradish peroxidase or alkaline phosphatase, a 
fluorophore such as fluorescein isothiocyanate or a chromophore f Current Protocols In 
Molecular Biology . Volume 2, Units 10, 1 1 and 14, Frederick M. Ausubul et al. eds., 
1995; Molecular Cloning. A Laboratory Manual § 12, Tom Maniatis et al. eds., 2d ed. 
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1989). 

Another type of covalent modification of the TRAG polypeptide included within 
the scope of this invention comprises altering the native glycosylation pattern of the 
polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein 
5 to mean deleting one or more carbohydrate moieties found in native sequence TRAG, 
and/or adding one or more glycosylation sites that are not present in the native sequence 
TRAG. Addition of glycosylation sites to the TRAG polypeptide may be accomplished 
by altering the amino acid sequence. The alteration may be made, for example, by the 
addition of, or substitution by, one or more serine or threonine residues to the native 
10 sequence TRAG (for O-linked glycosylation sites). The TRAG amino acid sequence 
may optionally be altered through changes at the DNA level, particularly by mutating the 
DNA encoding the TRAG polypeptide at preselected bases such that codons are 
generated that will translate into the desired amino acids. Another means of increasing 
the number of carbohydrate moieties on the TRAG polypeptide is by chemical or 
15 enzymatic coupling of glycosides to the polypeptide. Such methods are described in the 
art, e.g., in WO 87/05330 published 1 1 September 1987, and in Aplin and Wriston, CRC 
Crit. Rev. Biochem. . pp. 259-306 (1981). 

The TRAG of the present invention may also be modified in a way to form a 
chimeric molecule comprising TRAG fused to another, heterologous polypeptide or 
20 amino acid sequence. In one embodiment, such a- chimeric molecule comprises a fusion 
of the TRAG with a tag polypeptide which provides an epitope to which an anti-tag 
antibody can selectively bind. The epitope tag is generally placed at the amino- or 
carboxyl- terminus of the TRAG. The presence of such epitope-tagged forms of the 
TRAG can be detected using an antibody against the tag polypeptide. Also, provision of 
the epitope tag enables the TRAG to be readily purified by affinity purification using an 
anti-tag antibody or another type of affinity matrix that binds to the epitope tag. 

In an alternative embodiment, the chimeric molecule may comprise a fusion of 
the TRAG with an immunoglobulin or a particular region of an immunoglobulin. The 
TRAG may be fused to any one of a variety of known fusion protein partners that are 
well known in the art such as maltose binding protein, LacZ, thioredoxin, or an 
immunoglobulin constant region (Current Protocols In Molecular Biology , Volume 2, 
Unit 16, Frederick M. Ausubul et al. eds., 1995; Linsley et al., J.Exp. Med. , 174:561-566 
(1 991)). In a preferred embodiment, this fusion partner is a non-TRAG binding molecule 
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so as to prevent difficulties associated with intramolecular interactions. For a bivalent 
form of the chimeric molecule, such a fusion could be to the Fc region of an IgG 
molecule. Other fusion proteins and tag polypeptides and their respective antibodies are 
well known in the art. Examples include poly-histidine (poly-his) or poly-histidine- 
5 glycine (poly-his-gly) tags; the flu HA tag polypeptide and its antibody 12CA5 (Field et 
al., Mol. Cell. Biol.. 8:2159-2165 (1988)); the c-myc tag and the 8F9, 3C7, 6E10, G4, 
B7, and 9E10 antibodies thereto (Evan et at., Molecular and Cellular Biology . 5:3610- 
3616 (1985)); and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody 
(Paborsky et al., Protein Engineering . 3(6):547-553 (1990)). Other tag polypeptides 
include the Flag-peptide (Hopp et al., BioTechnology. 6:1204-1210 (1988)); the KT3 
epitope peptide (Martin et al., Science, 255:192-194 (1992)); an I -tubulin epitope 
peptide (Skinner et al. . J. Biol. Chem.. 266:15163-15166 (1991)); and the T7 gene 10 
protein peptide tag (Lutz-Freyermuth et al., Proc. Natl. Acad . Sci. USA, 87:6393-6397 
(1990)). 

C. Preparation of TRAG 

The description below relates primarily to production of TRAG by culturing cells 
transformed or transfected with a vector containing TRAG nucleic acid. It is, of course, 
contemplated that alternative methods, which are well known in the art, may be 
employed to prepare TRAG. For instance, the TRAG sequence, or portions thereof, may 
be produced by direct peptide synthesis using solid-phase techniques (see, e.g., Stewart 
et al., Solid-Phase Peptide Synthesis. W.H. Freeman Co., San Francisco, CA (1969); 
Merrifield, J. Am. Chem. Soc 85:2149-2154 (1963)). In vitro protein synthesis may be 
performed using manual techniques or by automation. Automated synthesis may be 
accomplished, for instance, using an Applied Biosystems Peptide Synthesizer (Foster 
City, CA) using manufacturers instructions. Various portions of the TRAG may be 
chemically synthesized separately and combined using chemical or enzymatic methods to 
produce the full-length TRAG. 

1. Isolation of DNA Encoding TRAG 

Due to the large size of TRAG mRNA, and therefore cDNA, both the rat and 
mouse TRAG genes (coding regions only) were isolated using the RT-PCR and RACE- 
PCR techniques, as is well-known in the art. Primer oligonucleotides were synthesized 
(Gibco, BRL Life Technologies) and used to amplify cDNA (produced directly from rat 
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or mouse mRNA), as described in Example 1 . Three portions of the mouse TRAG gene 
so amplified were joined through the use of restriction digestion and subcloning, as 
described in Example 8, to yield a full length gene of 4,467 nucleotides of coding 
sequence. 

5 Alternately, DNA encoding TRAG may also be obtained from a cDNA library 

prepared from tissue expressing a TRAG mRNA. Accordingly, human TRAG DNA can 
be conveniently obtained from a cDNA library prepared from human tissue. The 
TRAG-encoding gene may also be obtained from a genomic library or by 
oligonucleotide synthesis. Libraries can be screened with probes (such as antibodies to 

0 the TRAG or oligonucleotides of at least about 20-80 bases) designed to identify the 
gene of interest or the protein encoded by it. Illustrative libraries include mouse kidney 
cDNA library (mouse kidney S'-stretch cDNA, Clonetech laboratories, Inc.) and human 
liver cDNA library (human liver 5 1 stretch plus cDNA, Clonetech Laboratories, Inc.). 
Screening the cDNA or genomic library with the selected probe may be conducted using 

5 standard procedures, such as described in Sambrook et al., Molecular Cloning: A 
Laboratory Manual (New York: Cold Spring Harbor Laboratory Press, 1989). An 
alternative means to isolate the gene encoding TRAG is to use PCR methodology 
(Sambrook et aL, supra : Dieffenbach et al., PCR Primer: A Laboratory Manual (Cold 
Spring Harbor Laboratory Press, 1995)). 

o For cDNA library screening, oligonucleotide sequences selected as probes should 

be of sufficient length and sufficiently unambiguous that false positives are minimized. 
The oligonucleotide is preferably labeled such that it can be detected upon hybridization 
to DNA in the library being screened. Methods of labeling are well known in the art and 
include the use of radiolabels like 32 P~labeled ATP, biotinylation, or enzyme labeling. 

5 Hybridization conditions, including moderate stringency and high stringency, are 
provided in Sambrook et al., supra . 

Sequences identified in such library screening methods can be compared and 
aligned to other known sequences deposited and available in public databases such as 
GenBank or other private sequence databases. Sequence identity (at either the amino 

0 acid or nucleotide level) within defined regions of the molecule or across the full-length 
sequence can be determined through sequence alignment using computer software 
programs which employ various algorithms to measure homology. 

Nucleic acid having protein coding sequence may be obtained by screening 
selected cDNA or genomic libraries using the deduced amino acid sequence disclosed 
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herein for the first time, and, if necessary, using conventional primer extension 
procedures as described in Sambrook et al., supra, to detect precursors and processing 
intermediates of mRNA that may not have been reverse-transcribed into cDNA. 

5 2. Selection and Transformation of Host Cells 

Host cells are transfected or transformed with expression or cloning vectors 
described herein for TRAG production and cultured in conventional nutrient media 
modified as appropriate for inducing promoters, selecting transformants, or amplifying 
the genes encoding the desired sequences. The culture conditions, such as media, 
10 temperature, pH, and the like, can be selected by the skilled artisan without undue 
experimentation. In general, principles, protocols, and practical techniques for 
maximizing the productivity of cell cultures can be found in Mammalian Cell 
Biotechnology: a Practical Approach M. Butler, ed. (IRL Press, 1991) and Sambrook et 
al,, supra . 

15 Methods of transfection are known to the ordinarily skilled artisan, for example 

by using lipofectin, CaPC>4, or electroporation. Depending on the host cell used, 
transformation is performed using standard techniques appropriate to such cells. The 
calcium treatment employing calcium chloride, as described in Sambrook et al., supra, or 
electroporation is generally used for prokaryotes or other cells that contain substantial 

20 cell-wall barriers. Infection with Agrobacterium tumefaciens is used for transformation 
of certain plant cells, as described by Shaw et al, Gene. 23:315 (1983) and WO 
89/05859 published 29 June 1989. For mammalian cells without such cell walls, the 
calcium phosphate precipitation method of Graham and van der Eb, Virology. 52:456- 
457 (1978) can be employed. General aspects of mammalian cell host system 

25 transformations have been described in U.S. Patent No. 4,399,21 6. Transformations into 
yeast are typically carried out according to the method of VanSolingen et al., J. Bact . 
130:946 (1977) and Hsiao et al., Proc. Natl Acad. Sci. flJSAY 76:3829 (1979). 
However, other methods for introducing DNA into cells, such as by nuclear 
microinjection, electroporation, bacterial protoplast fusion with intact cells, or 

30 poly cations, e.g. , polybrene, polyornithine, may also be used. For various techniques for 
transforming mammalian cells, see Keown et al, Methods in Enzymology. 185:527-537 
(1990) and Mansour et al., Nature . 336:348-352 (1988). 

Suitable host cells for cloning or expressing the DNA in the vectors herein 
include prokaryote, yeast, or higher eukaryote cells. Suitable prokaryotes include but are 



16 



WO 01/66739 



PCT/US01/04475 



not limited to eubacteria, such as Gram-negative or Gram-positive organisms, for 
example, Enterobacteriaceae such as E. coli. Various E. coli strains are publicly 
available, such as E. coli K12 strain MM294 (ATCC 3 1,446); E. coli X1776 (ATCC 
31,537); K coli strain W3 110 (ATCC 27,325) and K5 772 (ATCC 53,635) 
5 Suitable host cells for the expression of glycosylated TRAG are derived from 

multicellular organisms. Examples of invertebrate cells include insect ceils such as 
Drosophila S2 and Spodoptera Sf9 cells. See e.g. Current Protocols In Molecular 
Biology, Volume I, Unit 16, Frederick M. Ausubul et al. eds., 1995. Examples of useful 
mammalian host cell lines include rat liver epithelial cells, Hugget, A. C. et. aL, supra, 
Chinese hamster ovary (CHO) and COS cells. More specific examples include monkey 
kidney CV1 line transformed by SV40 (COS-7, ATCC CRL 1651); human embryonic 
kidney line (293 or 293 cells subcloned for growth in suspension culture, Graham et al., 
J. Gen Virol. , 36:59 (1977)); Chinese hamster ovary cells/-DHFR (CHO, Urlaub and 
Chasin, Proc. Natl. Acad. Sci. USA , 77:4216 (1980)); mouse Sertoli cells (TM4, Mather, 
Biol. Reprod.. 23:243-251 (1980)); human lung cells (W138, ATCC CCL 75); human 
liver cells (Hep G2, HB 8065); and mouse mammary tumor (MMT 060562, ATCC 
CCL5 1). The selection of the appropriate host cell is deemed to be within the skill in the 
art. 

3. Selection and Use of a Replicable Vector 

The nucleic acid (e.g., cDNA or genomic DNA) encoding TRAG may be inserted 
into a replicable vector for cloning (amplification of the DNA) or for expression. 
Various vectors are publicly available. The vector may, for example, be in the form of a 
plasmid, cosmid, viral particle, or phage. The appropriate nucleic acid sequence may be 
inserted into the vector 

by a variety of procedures. In general, DNA is inserted into an appropriate restriction 
endonuclease site(s) using techniques known in the art. Vector components generally 
include, but are not limited to, one or more of a signal sequence, an origin of replication, 
one or more marker genes, an enhancer element, a promoter, and a transcription 
termination sequence. Construction of suitable vectors containing one or more of these 
components employs standard ligation techniques that are known to the skilled artisan. 

TRAG may be produced recombinantly not only directly, but also as a fusion 
polypeptide with a heterologous polypeptide, which may be a signal sequence or other 
polypeptide having a specific cleavage site at the N-terminus of the mature protein or 
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polypeptide. In general, the signal sequence may be a component of the vector, or it may 
be a part of the TRAG DNA that is inserted into the vector. The signal sequence may be 
a prokaryotic signal sequence selected, for example, from the group of the alkaline 
phosphatase, penicillinase, Ipp, or heat-stable enterotoxin II leaders. For yeast secretion 
5 the signal sequence may be, e.g., the yeast invertase leader, alpha factor leader (including 
Saccharomyces and Kluyveromyces a-factor leaders, the latter described in U.S. Patent 
No. 5,010,182), or acid phosphatase leader, the C. albicans glucoamylase leader (EP 
362,179 published 4 April 1990), or the signal described in WO 90/13646 published 15 
November 1990. In mammalian cell expression, mammalian signal sequences may be 
used to direct secretion of the protein, such as signal sequences from secreted 
polypeptides of the same or related species, as well as viral secretory leaders. 

Both expression and cloning vectors contain a nucleic acid sequence that enables 
the vector to replicate in one or more selected host cells. Such sequences are well known 
for a variety of bacteria, yeast, and viruses. The origin of replication from the plasmid 
pBR322 is suitable for most Gram-negative bacteria, the P2 plasmid origin is suitable 
for yeast, and various viral origins (S V40, polyoma, adenovirus, VS V or BPV) are useful 
for cloning vectors in mammalian cells. 

Expression and cloning vectors will typically contain a selection gene, also 
termed a selectable marker. Typical selection genes encode proteins that (a) confer 
resistance to antibiotics or other toxins, e.g., ampicillin, neomycin, methotrexate, or 
tetracycline, (b) complement auxotrophic deficiencies, or (c) supply critical nutrients not 
available from complex media, e.g., the gene encoding D-alanine racemase for Bacilli. 

An example of suitable selectable markers for mammalian cells are those that 
enable the identification of cells competent to take up the TRAG nucleic acid, such as 
Neomycin, DHFR , or thymidine kinase. An appropriate host cell when wild-type 
DHFR is employed is the CHO cell line deficient in DHFR activity, prepared and 
propagated as described by Urlaub et al, Proc. Natl. Acad. Sci. USA. 77:4216 (1980). A 
suitable selection gene for use in yeast is the trp\ gene present in the yeast plasmid YRp7 
(Stinchcomb et al., Nature. 282:39 (1979); Kingsman et al., Gene * 7:141 (1979); 
Tschemper et al., Gene. 10:157 (1980)). The trpl gene provides a selection marker for a 
mutant strain of yeast lacking the ability to grow in tryptophan, for example, ATCC No. 
44076 or PEP4-1 (Jones, Genetics , 85:12 (1977)). 

Expression and cloning vectors usually contain a promoter operably linked to the 
TRAG nucleic acid sequence to direct mRNA synthesis. Promoters recognized by a 
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variety of potential host cells are well known. Promoters suitable for use with 
prokaryotic hosts include the p-lactamase and lactose promoter systems (Chang et al., 
Nature. 275:615 (1978); Goeddel et al., Nature, 281:544 (1979)), alkaline phosphatase, a 
tryptophan (trp) promoter system (Goeddel, Nucleic Acids Res.. 8:4057 (1980); EP 
5 36,776), and hybrid promoters such as the tac promoter (deBoer et al., Proc. Natl. Acad. 
Sci. USA. 80:21-25 (1983)). Promoters for use in bacterial systems also will contain a 
Shine-Dalgarno (S.D.) sequence operably linked to the DNA encoding TRAG. 

Examples of suitable promoting sequences for use with yeast hosts include the 
promoters for 3-phosphoglycerate kinase (Hitzeman et al.. J. Biol. Chem.. 255:2073 

10 (1980)) or other glycolytic enzymes (Hess et al.. J. Adv. Enzvme Reg.. 7:149 (1968); 
Holland, Biochemistry. 17:4900 (1978)), such as enolase, glyceraldehyde-3-phosphate 
dehydrogenase, hexokinase, pyruvate decarboxylase, phosphofructokinase, glucoses- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase, and glucokinase. 

15 Other yeast promoters, which are inducible promoters having the additional 

advantage of transcription controlled by growth conditions, are the promoter regions for 
alcohol dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes 
associated with nitrogen metabolism, metallothionein, glyceraldehyde-3-phosphate 
dehydrogenase, and enzymes responsible for maltose and galactose utilization. Suitable 

20 vectors and promoters for use in yeast expression are further described in EP 73,657. 

TRAG transcription from vectors in mammalian host cells is controlled, for 
example, by promoters obtained from the genomes of viruses such as polyoma virus, 
fowlpox virus (UK 2,21 1,504 published 5 July 1989), adenovirus (such as Adenovirus 
2), bovine papilloma virus, avian sarcoma virus, cytomegalovirus, a retrovirus, hepatitis- 

25 B virus, and Simian Virus 40 (SV40), from heterologous mammalian promoters, e.g. , the 
actin promoter or an immunoglobulin promoter, and from heat-shock promoters, 
provided such promoters are compatible with the host cell systems. 

Transcription of a DNA encoding TRAG by higher eukaryotes may be increased 
by inserting an enhancer sequence into the vector. Enhancers are cis-acting elements of 

30 DNA, usually about from 10 to 300 bp, that act on a promoter to increase its 

transcription. Many enhancer sequences are now known from mammalian genes (globin, 
elastase, albumin, I -fetoprotein, and insulin). Typically, however, one will use an 
enhancer from a eukaryotic cell virus. Examples include the S V40 enhancer on the late 
side of the replication origin (bp 100-270), the cytomegalovirus early promoter 
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enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus 
enhancers. The enhancer may be spliced into the vector at a position 5* or 3' to the 
TRAG coding sequence but is preferably located at a site 5 1 from the promoter. 

Expression vectors used in eukaryotic host cells (yeast, fungi, insect, plant, 
5 animal, human, or nucleated cells from other multicellular organisms) will also contain 
sequences necessary for the termination of transcription and for stabilizing the mRNA. 
Such sequences are commonly available from the 5 f and, occasionally 3', untranslated 
regions of eukaryotic or viral DNAs or cDNAs. These regions contain nucleotide 
segments transcribed as polyadenylated fragments in the untranslated portion of the 
10 mRNA encoding TRAG. Still other methods, vectors, and host cells suitable for 

adaptation to the synthesis of TRAG in recombinant vertebrate cell culture are described 
in Getting etal., Nature, 293:620-625 (1981); Mantei et al., Nature, 281:40-46 (1979); 
EP 117,060; and EP 117,058. 

4. Detecting Gene Amplification/Expression 

Gene amplification and/or expression may be measured in a sample directly, for 
example, by conventional Southern blotting, Northern blotting to determine the 
transcription of mRNA (Thomas, Proc. Natl. Acad. Sci. USA. 77:5201-5205 (1980)), 
dot blotting (DNA analysis), or in situ hybridization, using an appropriately labeled 
probe, based on the sequences provided herein. Alternatively, antibodies may be 
employed that can recognize specific duplexes, including DNA duplexes, RNA duplexes, 
and DNA-RNA hybrid duplexes or DNA-protein duplexes. The antibodies in turn may 
be labeled and the assay may be carried out where the duplex is bound to a surface, so 
that upon the formation of duplex on the surface, the presence of antibody bound to the 
duplex can be detected. 

Gene expression, alternatively, may be measured by immunological methods, 
such as immunohistochemical staining of cells or tissue sections and assay of cell culture 
or body fluids, to quantitate directly the expression of gene product. Antibodies useful 
for immunohistochemical staining and/or assay of sample fluids may be either 
monoclonal or polyclonal and may be prepared in any mammal. Conveniently, the 
antibodies may be prepared against a native sequence TRAG polypeptide or against a 
synthetic peptide based on the DNA sequences provided herein or against exogenous 
sequence fused to TRAG DNA and encoding a specific antibody epitope. 
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5. Purification of Polypeptide 

Forms of TRAG may be recovered from culture medium or from host cell lysates. 
Cells employed in expression of TRAG can be disrupted by various physical or chemical 
means, such as freeze-thaw cycling, sonication, mechanical disruption, or cell lysing 
5 agents. 

It may be desired to purify TRAG from recombinant cell proteins or polypeptides. 
The following procedures are exemplary of suitable purification procedures: by 
fractionation on an ion-exchange column; ethanol precipitation; reverse phase HPLC; 
chromatography on silica or on a cation-exchange resin such as DEAE; 

10 chromatofocusing; SDS-PAGE; ammonium sulfate precipitation; gel filtration using, for 
example, Sephadex G-75; protein A Sepharose columns to remove contaminants such as 
IgG; and metal chelating columns to bind epitope-tagged forms of the TRAG. Various 
methods of protein purification may be employed and such methods are known in the art 
and described for example in Deutscher. Methods in Enzymology. 182 (1990); Scopes, 

15 Protein Purification: Principles and Practice. Springer-Veriag, New York (1982). The 
purification step(s) selected will depend, for example, on the nature of the production 
process used and the particular TRAG produced. 

D. Uses for TRAG 

20 Nucleotide sequences (or their complement) encoding TRAG have various 

applications in the art of molecular biology, including uses as hybridization probes, in 
chromosome and gene mapping, and in the generation of antisense RNA and DNA. 
TRAG nucleic acid will also be useful for the preparation of TRAG polypeptides by the 
recombinant techniques described herein. TRAG polypeptides have various applications 

25 in the art, including uses for evaluating factors that interact with and/or control TGF-p 
signaling as means for understanding both cell proliferation control and oncogenesis. 
Moreover, TRAG genes may be introduced into cells to effect mechanisms mediated by 
TGF-p as well as processes involved in oncogenesis. 



30 1 . Screening Methods Utilizing TRAG Nucleic Acids. 

The full-length native sequence TRAG (Tables 1, 2, and 3) gene, or portions 
thereof, may be used as hybridization probes for acDNA library to isolate, for instance, 
still other genes (like those encoding naturally-occurring variants of TRAG or TRAG 
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from other species) that have a desired sequence identity to the TRAG sequences 
disclosed in Tables 1, 2, or 3. Optionally, the length of the probes will be about 20 to 
about 500 bases. The hybridization probes may be derived from the nucleotide sequence 
or from genomic sequences including promoters, enhancer elements, and introns of 
5 native sequence TRAG. By way of example, a screening method will include isolating 
the coding region of the TRAG gene using the known DNA sequence to synthesize a 
selected probe of about 40 bases. Hybridization probes may be labeled by a variety of 
labels, including radionucleotides such as 32 P or 35 S, or enzymatic labels such as alkaline 
phosphatase coupled to the probe via avidin/biotin coupling systems. Labeled probes 

10 having a sequence complementary to that of the TRAG gene of the present invention can 
be used to screen libraries of human cDNA, genomic DNA, or mRNA to determine to 
which members of such libraries the probe hybridizes. Hybridization techniques are 
described in further detail in the Examples below. 

Nucleotide sequences encoding a TRAG can also be used to construct 

1 5 hybridization probes for mapping the gene that encodes that TRAG and for the genetic 
analysis of individuals with genetic disorders. The nucleotide sequences provided herein 
may be mapped to a chromosome and specific regions of a chromosome using known 
techniques, such as in situ hybridization, linkage analysis against known chromosomal 
markers, and hybridization screening with libraries. 

20 Screening assays can be designed to find lead compounds that mimic the 

biological activity of a native TRAG or a ligand or receptor for TRAG. Such screening 
assays will include assays amenable to high-throughput screening of chemical libraries, 
making them particularly suitable for identifying small molecule drug candidates. Small 
molecules contemplated include synthetic organic or inorganic compounds. The assays 

25 can be performed in a variety of formats, including protein-protein binding assays, 
biochemical screening assays, immunoassays, and cell-based assays, which are well 
characterized in the art. 

2. Modulation of TRAG protein Expression via TRAG Antisense Oligonucleotides. 
30 Antisense technology entails the administration of exogenous oligonucleotides 

that bind to a target polynucleotide located within the cells. The term "antisense 11 refers 
to the fact that such oligonucleotides are complementary to their intracellular targets, 
e.g., TRAG. (See for example, Jack Cohen, OLIGODEOXYNUCLEOTIDES, Antisense 
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Inhibitors of Gene Expression, CRC Press, 1989; and Synthesis 1:1-5 (1988)). The 
TRAG antisense oligonucleotides of the present invention include derivatives such as S- 
oligonucleotides (phosphorothioate derivatives or S-oligos, see, Jack Cohen, supra) , 
which exhibit enhanced cancer cell growth inhibitory action. 
5 S-oligos (nucleoside phosphorothioates) are isoelectronic analogs of an 

oligonucleotide (O-oligo) in which a nonbridging oxygen atom of the phosphate group is 
replaced by a sulfur atom. The S-oligos of the present invention may be prepared by 
treatment of the corresponding O-oligos with 3H-l,2-benzodithiol-3 -one- 1,1 -dioxide, 
which is a sulfur transfer reagent. See Iyer, R. P. et al, J. Org. Chem. 55:4693-4698 
(1990); and Iyer, R. P. et al., J. Am. Chem. Soc , 1 12:1253-1254 (1990), the disclosures 
of which are fully incorporated by reference herein. 

The TRAG antisense oligonucleotides of the present invention may be RNA or 
DNA which is complementary to and stably hybridizes with the first 100 N-terminal 
codons or last 100 C-terminal codons of the TRAG genome or the corresponding mRNA. 
While absolute complementarity is not required, high degrees of complementarity are 
preferred. Use of an oligonucleotide complementary to this region allows for the 
selective hybridization to TRAG mRNA and not to mRNA specifying other regulatory 
subunits of protein kinase. Preferably, the TRAG antisense oligonucleotides of the 
present invention are a 15 to 30-mer fragment of the antisense DNA molecule having a 
sequence that hybridizes to TRAG mRNA. Optionally, TRAG antisense oligonucleotide 
is a 30-mer oligonucleotide that is complementary to a region in the first 10 N-terminal 
codons and last 10 C-terminal codons of TRAG. Alternatively, the antisense molecules 
are modified to employ ribozymes in the inhibition of TRAG expression. (L.A. Couture 
& D. T. Stinchcomb, Trends Genet.. 12: 510-515 (1996)). 

In one embodiment, the TRAG antisense oligonucleotide is coadministered with 
an agent that enhances the uptake of the antisense molecule by the cells. For example, the 
TRAG antisense oligonucleotide may be combined with a lipophilic cationic compound 
that may be in the form of liposomes. The use of liposomes to introduce nucleotides into 
cells is taught, for example, in U.S. Pat. Nos. 4,897,355 and 4,394,448, the disclosures of 
which are incorporated by reference in their entirety. See also U.S. Pat. Nos. 4,235,871, 
4,23 1,877, 4,224,179, 4,753,788, 4,673,567, 4,247,41 1, and 4,814,270 for general 
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methods of preparing liposomes comprising biological materials. Alternatively, the 
TRAG antisense oligonucleotide may be combined with a lipophilic carrier such as any 
one of a number of sterols including cholesterol, cholate, and deoxycholic acid. 
In another embodiment, the TRAG antisense oligonucleotide may be 
5 coadministered with a second agent that is affected by TRAG expression. In one 
embodiment, this second agent is one or more isoforms of TGF-p. In a preferred 
embodiment, a combination of TRAG antisense oligonucleotides and TGF-pl are 
administered to cells that have reduced sensitivity to TGF-p due to TRAG 
overexpression. In this embodiment, a combination of these two molecules may be used 

10 to synergistically induce TGF-p 1 mediated apoptosis. Methods pertaining to these 

embodiments are well known in the art. (Zwicker et al., Science, 271:1595-1597 (1996); 
Field et al., Cell, 85: 549-561 (1996); Slack et al., J. Cell Bio. ,129: 779-788 (1995); 
Hiebert et al., Mol Cell Biol. , 15: 6864-6874 (1995); White, E., Genes Dev. , 10: 1-15 
(1996); Martin et al., Cell, 82:349-352 (1995)). 

15 The antisense oligonucleotides of the present invention may be prepared 

according to any of the methods that are well known to those of ordinary skill in the art. 
Preferably, the antisense oligonucleotides are prepared by solid phase synthesis. (See 
Goodchild, J., Bioconjugate Chemistry , 1:165-167 (1990)), for areview of the chemical 
synthesis of oligonucleotides. Alternatively, the antisense oligonucleotides can be 

20 obtained from a number of companies that specialize in the custom synthesis of 
oligonucleotides. 

3. Use of TRAG Nucleic Acids in the Generation of Transgenic Animals . 

Nucleic acids that encode TRAG or its modified forms can also be used to 

25 generate either transgenic animals or "knockout" animals which, in turn, are useful in the 
development and screening of therapeutically useful reagents. A transgenic animal (e.g., 
a mouse or rat) is an animal having cells that contain a transgene, which transgene was 
introduced into the animal or an ancestor of the animal at a prenatal stage, e.g., an 
embryonic stage. A transgene is a DNA that is integrated into the genome of a cell from 

30 which a transgenic animal develops. In one embodiment, cDNA encoding TRAG can be 
used to clone genomic DNA encoding TRAG in accordance with established techniques 
and the genomic sequences used to generate transgenic animals that contain cells that 
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express DNA encoding TRAG. Methods for generating transgenic animals, particularly 
animals such as mice or rats, have become conventional in the art and are described, for 
example, in U.S. Patent Nos. 4,736,866 and 4,870,009. Typically, particular cells would 
be targeted for TRAG transgene incorporation with tissue-specific enhancers. 
5 Transgenic animals that include a copy of a transgene encoding TRAG introduced into 
the germ line of the animal at an embryonic stage can be used to examine the effect of 
increased expression of DNA encoding TRAG. Such animals can be used as tester 
animals for reagents thought to confer protection from, for example, pathological 
conditions associated with its overexpression. In accordance with this facet of the 
10 invention, if an animal is treated with the reagent and a reduced incidence of the 
pathological condition is observed as compared to untreated animals bearing the 
transgene, this would indicate a potential therapeutic intervention for the pathological 
condition. 

Alternatively, nonhuman homologues of TRAG can be used to construct a TRAG 

15 knockout animal that has a defective or altered gene encoding TRAG as a result of 
homologous recombination between the endogenous gene encoding TRAG and altered 
genomic DNA encoding TRAG introduced into an embryonic cell of the animal. For 
example, cDNA encoding TRAG can be used to clone genomic DNA encoding TRAG in 
accordance with established techniques. A portion of the genomic DNA encoding 

20 TRAG can be deleted or replaced with another gene, such as a gene encoding a selectable 
marker that can be used to monitor integration. Typically, several kilobases of unaltered 
flanking DNA (both at the 5' and 3' ends) are included in the vector (see e.g., Thomas and 
Capecchi, Cell, 51:503 (1987) for a description of homologous recombination vectors). 
The vector is introduced into an embryonic stem cell line (e.g., by electroporation) and 

25 cells in which the introduced DNA has homologously recombined with the endogenous 
DNA are selected (see e.g., Li et al., Cell, 69:915 (1992)). The selected cells are then 
injected into a blastocyst of an animal (e.g., a mouse or rat) to form aggregation chimeras 
(see e.g., Bradley, in Teratocarcinomas and Embryonic Stem Cells: A Practical 
Approach. E. J. Robertson, ed. (IRL, Oxford, 1987), pp. 1 13-152). A chimeric embryo 

30 can then be implanted into a suitable pseudopregnant female foster animal and the 
embryo brought to term to create a knockout animal. Progeny harboring the 
homologously recombined DNA in their germ cells can be identified by standard 
techniques and used to breed animals in which all cells of the animal contain the 
homologously recombined DNA. Knockout animals can be characterized for instance, 
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for their ability to defend against certain pathological conditions and for their 
development of pathological conditions due to absence of the TRAG polypeptide. 

4. Use of TRAG Upstream Control Sequences For Evaluating Neoplastic Processes 
5 The genomic TRAG control sequences of the present invention, whether positive, 

negative, or both, may be employed in numerous various combinations and organizations 
to assess the regulation of TRAG. Moreover, in the context of multiple unit embodiments 
and/or in embodiments that incorporate both positive and negative control units, there is 
no requirement that such units be arranged in an adjacent head-to-head or head-to-tail 

10 construction in that the improved regulation capability of such multiple units is conferred 
virtually independent of the location of such multiple sequences with respect to each 
other. Moreover, there is no requirement that each unit to comprise the same positive or 
negative element. All that is required is that such sequences be located upstream of and 
sufficiently proximal to a transcription initiation site. 

15 To evaluate TRAG regulatory elements in the context of heterologous genes, one 

simply obtains the structural gene and locates one or more of such control sequences 
upstream of a transcription initiation site. Additionally, as is known in the art, it is 
generally desirable to include TATA-box sequences upstream of and proximal to a 
transcription initiation site of the heterologous structural gene. Such sequences may be 

20 synthesized and inserted in the same manner as the novel control sequences. 
Alternatively, one may desire to simply employ the TATA sequences normally 
associated with the heterologous gene. In any event, TATA sequences are most desirably 
located between about 20 and 30 nucleotides upstream of transcription initiation. 

Preferably the heterologous gene is a reporter gene that encodes an enzyme that 

25 produces colorimetric or fluorometric change in the host cell, which is detectable by in 
situ analysis and which is a quantitative or semiquantitative function of transcriptional 
activation. Exemplary enzymes include esterases, phosphatases, proteases (tissue 
plasminogen activator or urokinase), and other enzymes capable of being detected by 
activity that generates a chromophore or fluorophore as will be known to those skilled in 

30 the art. A preferred example is E. coli beta-galactosidase. This enzyme produces a color 
change upon cleavage of the indigogenic substrate indolyl-B-D-galactoside by cells 
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bearing beta-galactosidase {see, e.g., Goring et aL, Scienc e, 235:456-458 (1987) and 
Price et aL, Proc. Natl. Acad. Sci. U.S.A., 84:156-160 (1987)). Thus, this enzyme 
facilitates automatic plate reader analysis of TRAG control sequence mediated 
expression directly in microtiter wells containing transformants treated with candidate 
5 activators. Also, because the endogenous beta-galactosidase activity in mammalian cells 
ordinarily is quite low, the analytic screening system using p-galactosidase is not 
hampered by host-cell background. 

Another class of reporter genes that confer detectable characteristics on a host cell 
are those that encode polypeptides, generally enzymes, which render their transformants 
10 resistant against toxins, e.g., the neo gene, which protects host cells against toxic levels 
of the antibiotic G418; a gene encoding dihydrofolate reductase, which confers resistance 
to methotrexate; or the chloramphenicol acetyltransferase (CAT) gene (Osborne et aL, 
Cell, 42:203-212 (1985)). Genes of this class are not preferred since the phenotype 
(resistance) does not provide a convenient or rapid quantitative output. Resistance to 
antibiotic or toxin requires days of culture to confirm or complex assay procedures if 
other than a biological determination is to be made. 

5. Use of TRAG Polypeptides in Protein-Protein Interaction Studies. 

Co-immunoprecipitation and Gal4 protein-protein interaction assays may be 
useful in screening for compounds modulating TRAG activity or in screening for 
compounds altering TRAG activity in a cell. For example, TRAG may participate with 
other WD repeat element-containing proteins in modulating TGF-p signaling. Those 
skilled in the art will understand that binding of a ligand at a molecular binding site can 
be modulated in a direct manner (e.g., by blocking the site), as well as altered in an 
indirect manner (e.g., by conformational changes induced following binding of a second 
(different) ligand at a distant site). In this regard, it is likely that the binding site 
specificity of TRAG for a particular WD repeat element-containing protein can be 
completely altered (i.e., to bind a different ligand) by agents that bind at distant sites in 
these proteins. A number of exemplary protocols that may be used in these studies are 
known in the art, see e.g. U.S. Patent No. 5,625,031. 
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Those skilled in the art will recognize that the functional regions of WD repeat 
element-containing proteins are particularly attractive targets for three-dimensional 
molecular modeling and for the construction of mimetic compounds, e.g., organic 
chemicals constructed to mimic the three-dimensional interactions between TRAG and 
5 another WD repeat element-containing protein. (See e.g., J. Wang, Curr Opin Gen. Dev , 
7:39 (1997); Y. Taya et aL, Trend Biochem. Sci. , 22:14 (1997)). 

6. Use of TRAG Containing Expression Vectors for Modulating Cellular Phenotype. 
As discussed above, TRAG genes can be incorporated into any standard cloning 

10 vector. The term "vector" is well understood in the art and is synonymous with the 
often-used phrase "cloning vehicle." A suitable vector is a nonchromosomal double- 
stranded DNA comprising an intact replicon such that the vector is replicated when 
placed within a unicellular organism. Viral vectors include retroviruses, adenoviruses, 
adeno-associated virus (AAV), herpes virus, papovirus, adeno-retrovirus, etc. Other 

15 suitable vectors include plasmids. Plasmids and retroviruses are generally preferred as 
vectors. 

As discussed in Example 8, pcDNA3.1-TRAG (pcDNA3.1 purchased from 
Invitrogen) was constructed and contained the CMV promoter. This promoter is suitable 
for expression of the TRAG genes in a wide variety of cells. However, the CMV 

20 promoter is not specifically required for transcription and expression of the TRAG genes. 
The CMV promoter can be replaced with other known promoters to improve the 
efficiency of transcription and expression in particular cells. 

The promoter DNA can be amplified using PCR technology while concurrently 
providing restriction sites at the 5' and 3 1 ends of the promoter DNA. The amplified 

25 promoter DNA can then be inserted into a cloning vehicle (for example pcDNA3.1) 
using conventional endonucleases and known recombinant DNA technology. Cloning 
vectors containing the desired promoter upstream of the 5 1 end of TRAG genes may be 
constructed in this manner. 

Cellular phenotype can be influenced by using the TRAG gene. In this context, 

30 cloning vectors containing an appropriate promoter and the TRAG gene may be 

constructed using PCR technology in a manner analogous to the preparation of vectors 
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containing exogenous genes as is well known in the art. Cloning vectors containing 
TRAG genes are transfected into host cells using known transfection processes. Suitable 
transfection processes include lipofection, electroporation, and retrovirus infection. 
When transfecting cells with TRAG, the desired cells are isolated and cultured in suitable 
5 media. 

Transfection of cells using lipofection may be conducted according to standard 
lipofection procedures. (See Feigner et al., Proc. Natl. Acad. Sci. (USA), 84:74 13-74 1 7 
(1987)). In general, liposome-mediated DNA transfection is accomplished by exposing 
1-20 micrograms of plasmid DNA and commercially available liposomes (Bethesda 

10 Research Laboratories) in culture medium. The transfected cells are then repeatedly 
passaged in culture medium and the desired clones are isolated. 

Retrovirus infection may also be accomplished using previously described 
procedures. (See, for example, Miller et al., J. Virol., 62:4337-4345 and Halbert et al., J. 
Virol., 65:473-378 (1991)). In general, plasmid DNA is transfected into a desired 

15 packaging cell line such as Psi-2 or other cell lines, using standard calcium phosphate 
precipitation. Viruses produced from the Psi-2 cells or equivalent cells are then used to 
infect an amphotropic packaging cell line, for example PA317. Viruses produced by the 
amphotropic packaging cell line are used to infect the desired host parent cells of the 
present invention. 

20 Selection of clones with a modulated phenotype may be undertaken by a variety 

of protocols that are well known in the art (see e.g. U.S. Patent No. 5,376,542). In such 
selections, the cells may be selected for their ability to respond to factors such as TGF-p. 
Alternatively, cells can be selected by their ability to form colonies and grow in soft 
agar. Moreover, the cells can be selected by their ability to form tumors in animal 

25 models such as in nude mice. Similarly, cancer cells or cells having aggressive 
metastasis can be selected. 

After transfection, the desired clones are selected by culturing in optimal media 
and repeated passaging. Generally, 10-20 passages are required to eliminate spurious 
cells and obtain pure clonal cells. Optimal media are selected according to the type of 

30 parent ceil that is utilized. For lymphocytes, RPMI media is preferred; for fibroblasts, 
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DMEM media is preferred; and for epithelial cells, a serum-free medium such as 
keratinocyte growth medium (KGM) or SFM (Gibco Company) is preferred. 

Selected colonies are then tested to verify the presence of TRAG DNA and the 
expression of TRAG genes. Verification is confirmed by standard Southern hybridization 
5 techniques and immunoprecipitation to determine the presence or quantity of expressed 
TRAG proteins. 

7. Chromosomal Localization. 

In Example 2, chromosomal localization of the human and murine TRAG genes 

10 is described. Chromosomal localization was done by FISH analysis (Stokke, T. et al, 
Genomics, 26: 134-137 (1995)). Murine TRAG maps to chromosome 18, and human 
TRAG maps to chromosome 18. 

In other embodiments the invention provides diagnostic assays for determining 
chromosomal rearrangement of TRAG genes in a cell. The chromosomal location of 

15 TRAG genes is conveniently determined in chromosomal smears by in situ hybridization 
with oligonucleotide probes or cDNA and the like. Translocation of a TRAG gene, i.e., 
from a chromosomal location found in a normal cell to a location found in a transformed 
cell, may contribute to a phenotype of uncontrolled cell growth by removing normal 
transcription regulatory control of expression of TRAG. In the case where the 

20 rearrangement induces overexpression, the cell may acquire a malignant (i.e., 

uncontrolled) growth phenotype, and in the case where the rearrangement induces 
underexpression, the cell may undergo premature senescence. Screening cellular 
samples from individuals for the potential of TRAG chromosomal rearrangement may 
indicate a relative risk factor for the possibility of developing cancer, 

25 

E. Anti-TRAG Antibodies 

The present invention further provides anti-TRAG antibodies. Exemplary 
antibodies include polyclonal, monoclonal, humanized, bispecific, and heteroconjugate 
antibodies. 

30 

1. Polyclonal Antibodies 

The TRAG antibodies may comprise polyclonal antibodies. Methods of 
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preparing polyclonal antibodies are known to the skilled artisan. Polyclonal antibodies 
can be raised in a mammal, for example, by one or more injections of an immunizing 
agent and, if desired, an adjuvant. Typically, the immunizing agent and/or adjuvant will 
be injected in the mammal by multiple subcutaneous or intraperitoneal injections. The 
5 immunizing agent may include the TRAG polypeptide or a fusion protein thereof. It may 
be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are 
not limited to keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, and 
soybean trypsin inhibitor. Examples of adjuvants that may be employed include Freund's 
complete adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic 
trehalose dicorynomycolate). Further, polyclonal antibodies may be generated 
commercially, for example by Genemed Synthesis, Inc. using art-accepted methods. 

2. Monoclonal Antibodies 

The TRAG antibodies may, alternatively, be monoclonal antibodies. Monoclonal 
antibodies may be prepared using hybridoma methods, such as those described by Kohler 
and Milstein, Nature. 256:495 (1975). In a hybridoma method, a mouse, hamster, or 
other appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically 
bind to the immunizing agent. Alternatively, the lymphocytes may be immunized in 
vitro. 

The immunizing agent will typically include the TRAG polypeptide or a fusion 
protein thereof. Generally, either peripheral blood lymphocytes (TBLs") are used if 
cells of human origin are desired, or spleen cells or lymph node cells are used if 
nonhuman mammalian sources are desired. The lymphocytes are then fused with an 
immortalized cell line using a suitable fusing agent, such as polyethylene glycol, to form 
a hybridoma cell (Goding, Monoclonal Antibodies: Principles and Practice, Academic 
Press, (1986) pp. 59-103). Immortalized cell lines are usually transformed mammalian 
cells, particularly myeloma cells of rodent, bovine, and human origin. Usually, rat or 
mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the 
growth or survival of the unfused, immortalized cells. For example, if the parental cells 
lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), 
the culture medium for the hybridomas typically will include hypoxanthine, aminopterin, 
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and thymidine ( n HAT medium 11 ), which are substances that can prevent the growth of 
HGPRT-deficient cells. 

Preferred immortalized cell lines are those that fuse efficiently, support stable 
high level expression of antibody by the selected antibody-producing cells, and are 
5 sensitive to a medium such as HAT medium. More preferred immortalized cell lines are 
murine myeloma lines, which can be obtained, for instance, from the Salk Institute Cell 
Distribution Center, San Diego, California and the American Type Culture Collection, 
Manassas, Virginia. Human myeloma and mouse-human heteromyeloma cell lines also 
have been described for the production of human monoclonal antibodies (Kozbor, J., 
Immunol.. 133:3001 (1984); Brodeur et al., Monoclonal Antibody Production 
Techniques and Applications, Marcel Dekker, Inc., New York, (1987) pp. 51-63). 

The culture medium in which the hybridoma cells are cultured can then be 
assayed for the presence of monoclonal antibodies directed against TRAG. Preferably, 
the binding specificity of monoclonal antibodies produced by the hybridoma cells is 
determined by immunoprecipitation or by an in vitro binding assay, such as 
radioimmunoassay (RIA) or enzyme-linked immunoabsorbent assay (ELISA). Such 
techniques and assays are known in the art. The binding affinity of the monoclonal 
antibody can, for example, be determined by the Scatchard analysis of Munson and 
Pollard, Anal. Biochem., 107:220 (1980). 

After the desired hybridoma cells are identified, the clones may be subcloned by 
limiting dilution procedures and grown by standard methods (Goding, supra) . Suitable 
culture media for this purpose include, for example, Dulbecco's Modified Eagle's 
Medium and RPMI-1640 medium. Alternatively, the hybridoma cells may be grown in 
vivo as ascites in a mammal. The monoclonal antibodies secreted by the subclones may 
be isolated or purified from the culture medium or ascites fluid by conventional 
immunoglobulin purification procedures such as, for example, protein A-Sepharose, 
hydroxylapatite chromatography, gel electrophoresis, dialysis, or affinity 
chromatography. The monoclonal antibodies may also be made by recombinant DNA 
methods, such as those described in U.S. Patent No. 4,816,567. DNA encoding the 
monoclonal antibodies of the invention can be readily isolated and sequenced using 
conventional procedures (e.g., by using oligonucleotide probes that are capable of 
binding specifically to genes encoding the heavy and light chains of murine antibodies). 
The hybridoma cells of the invention serve as a preferred source of such DNA. Once 
isolated, the DNA may be placed into expression vectors, which are then transfected into 
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host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or myeloma 
cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also may be modified, 
for example, by substituting the coding sequence for human heavy and light chain 
5 constant domains in place of the homologous murine sequences (U.S. Patent No. 

4,816,567) or by covalently joining to the immunoglobulin coding sequence all or part of 
the coding sequence for a nonimmunoglobulin polypeptide. Such a nonimmunoglobulin 
polypeptide can be substituted for the constant domains of an antibody of the invention 
or can be substituted for the variable domains of one antigen-combining site of an 

10 antibody of the invention to create a chimeric bivalent antibody. 

The antibodies may be monovalent antibodies. Methods for preparing 
monovalent antibodies are well known in the art. For example, one method involves 
recombinant expression of immunoglobulin light chain and modified heavy chain. The 
heavy chain is truncated generally at any point in the Fc region so as to prevent heavy 

1 5 chain crosslinking. 

Alternatively, the relevant cysteine residues are substituted with another amino 
acid residue or are deleted so as to prevent crosslinking. In vitro methods are also 
suitable for preparing monovalent antibodies. Digestion of antibodies to produce 
fragments thereof, particularly, Fab fragments, can be accomplished using routine 

20 techniques known in the art. 

3. Humanized Antibodies 

The TRAG antibodies of the invention may further comprise humanized 
antibodies or human antibodies. Humanized forms of nonhuman (e.g., murine) 

25 antibodies are chimeric immunoglobulins, immunoglobulin chains, or fragments thereof 
(such as Fv, Fab, Fab', F(ab')2, or other antigen-binding subsequences of antibodies) that 
contain minimal sequence derived from nonhuman immunoglobulin. Humanized 
antibodies include human immunoglobulins (recipient antibody) in which residues from a 
complementary-determining region (CDR) of the recipient are replaced by residues from 

30 a CDR of a nonhuman species (donor antibody) such as mouse, rat, or rabbit having the 
desired specificity, affinity, and capacity. In some instances, Fv framework residues of 
the human immunoglobulin are replaced by corresponding nonhuman residues. 
Humanized antibodies may also comprise residues that are found neither in the recipient 
antibody nor in the imported CDR or framework sequences. In general, the humanized 
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antibody will comprise substantially all of at least one, and typically two, variable 
domains, in which all or substantially all of the CDR regions correspond to those of a 
nonhuman immunoglobulin and all or substantially all of the FR regions are those of a 
human immunoglobulin consensus sequence. The humanized antibody optimally also 

5 will comprise at least a portion of an immunoglobulin constant region (Fc), typically that 
of a human immunoglobulin (Jones et al, Nature. 321 :522-525 (1986); Riechmann et 
al., Nature . 332:323-329 (1988); and Presta, Curr. Op. Struct BioL 2:593-596 (1992)). 

Methods for humanizing nonhuman antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it 

10 from a source that is nonhuman. These nonhuman amino acid residues are often referred 
to as "import" residues, which are typically taken from an "import" variable domain. 
Humanization can be essentially performed following the method of Winter and 
coworkers (Jones et al., Nature. 321:522-525 (1986); Riechmann et al., Nature. 
332:323-327 (1988); Verhoeyen et al. Science, 239:1534-1536 (1988)), by substituting 

15 rodent CDRs or CDR sequences for the corresponding sequences of a human antibody. 
Accordingly, such "humanized" antibodies are chimeric antibodies (U.S. Patent No. 
4,8 1 6,567), wherein substantially less than an intact human variable domain has been 
substituted by the corresponding sequence from a nonhuman species. In practice, 
humanized antibodies are typically human antibodies in which some CDR residues and 

20 possibly some FR residues are substituted by residues from analogous sites in rodent 
antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227:381 
(1991); Marks et al., J. Mol. Biol. . 222:581 (1991)). The techniques of Cole et al. and 
25 Boerner et al. are also available for the preparation of human monoclonal antibodies 

(Cole et al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and 
Boerner et al. . J. Immunol, 147(l):86-95 (1991)). 

Humanized antibodies can also be prepared according to the methods disclosed 
by, for example, U.S. Patent Nos. 5,175,384; 5,434,340; 5,545,806; 5,569,825; 
30 5,591,669; 5,625,126; 5,633,425; 5,916,771; and 5,589,369. 



4. Bispecific Antibodies 

Bispecific antibodies are monoclonal, preferably human or humanized, antibodies 
that have binding specificities for at least two different antigens. In the present case, one 
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of the binding specificities is for the TRAG, the other one is for any other antigen, and 
preferably for a cell-surface protein or receptor or receptor subunit. Methods for making 
bispecific antibodies are known in the art. Traditionally, the recombinant production of 
bispecific antibodies is based on the co-expression of two immunoglobulin heavy- 
5 chain/light-chain pairs, where the two heavy chains have different specificities (Milstein 
and CueUo, Nature, 305:537-539 (1983)). Because of the random assortment of 
immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by 

10 affinity chromatography steps. Similar procedures are disclosed in WO 93/08829, 
published 13 May 1993, and in Traunecker et aL, EMBOJ.. 10:3655-3659 (1991). 

Antibody variable domains with the desired binding specificities (antibody- 
antigen combining sites) can be fused to immunoglobulin constant domain sequences. 
The fusion preferably is with an immunoglobulin heavy-chain constant domain, 

15 comprising at least part of the hinge, CH2, and CH3 regions. It is preferred to have the 
first heavy-chain constant region (CHI) containing the site necessary for light-chain 
binding present in at least one of the fusions. DNAs encoding the immunoglobulin 
heavy-chain fusions and, if desired, the immunoglobulin light chain, are inserted into 
separate expression vectors and are cotransfected into a suitable host organism. For 

20 further details of generating bispecific antibodies see, for example, Suresh et al., Methods 
inEnzymology. 121:210 (1986). 

5. Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 

25 Heteroconjugate antibodies are composed of two covalently joined antibodies. Such 
antibodies have, for example, been proposed to target immune system cells to unwanted 
cells (U.S. Patent No. 4,676,980) and for treatment of HIV infection (WO 91/00360; WO 
92/200373; EP 03089). It is contemplated that the antibodies may be prepared in vitro 
using known methods in synthetic protein chemistry, including those involving 

30 crosslinking agents. For example, immunotoxins may be constructed using a disulfide 
exchange reaction or by forming a thioether bond. Examples of suitable reagents for this 
purpose include iminothiolate and methyl-4-mercaptobutyrimidate and those disclosed, 
for example, in U.S. Patent No. 4,676,980. 
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F. Uses for TRAG Antibodies 

The TRAG antibodies of the invention have various utilities. For example, 
TRAG antibodies may be used in diagnostic assays for TRAG, e.g., detecting its 
expression in specific cells, tissues, or serum. Various diagnostic assay techniques 
5 known in the art may be used, such as competitive binding assays, direct or indirect 
sandwich assays, and immunoprecipitation assays conducted in either heterogeneous or 
homogeneous phases (Zola, Monoclonal Antibodies: A Manual of Techniques. CRC 
Press, Inc. (1987) pp. 147-158). The antibodies used in the diagnostic assays can be 
labeled with a detectable moiety. The detectable moiety should be capable of producing, 

10 either directly or indirectly, a detectable signal. For example, the detectable moiety may 
be a radioisotope, such as 3 H, 14 C 5 32 P, 35 S, or 125 I, a fluorescent or chemiluminescent 
compound, such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, 
such as alkaline phosphatase, beta-galactosidase or horseradish peroxidase. Any method 
known in the art for conjugating the antibody to the detectable moiety may be employed, 

15 including those methods described by Hunter et al., Nature, 144:945 (1962); David et 
al, Biochemistry . 13:1014 (1974); Pain et al., J. Immunol Meth.. 40:219 (1981); and 
Nygren, J. Histochem. and Cvtochem. . 30:407 (1982). In addition, TRAG or antibodies 
that recognize TRAG may be used in drug screening assays to identify compounds that 
act to positively or negatively modulate the function of TRAG. 

20 The antibodies can also be TRAG antagonists or agonists. Antibodies may also 

be useful therapeutically either alone, as agents that would act directly to interfere with 
the function of TRAG or indirectly as targeting agents capable of delivering a toxin, for 
example, pseudomonas exotoxin or radioisotopes, conjugated thereto to a desired site. 
TRAG antibodies also are useful for the affinity purification of TRAG from 

25 recombinant cell culture or natural sources. In this process, the antibodies against TRAG 
are immobilized on a suitable support, such a Sephadex resin or filter paper, using 
methods well known in the art. The immobilized antibody then is contacted with a 
sample containing the TRAG to be purified, and thereafter the support is washed with a 
suitable solvent that will remove substantially all the material in the sample except the 

30 TRAG, which is bound to the immobilized antibody. Finally, the support is washed with 

another suitable solvent that will release the TRAG from the antibody, 
***************************** 

The following examples are offered for illustrative purposes only and are not 
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intended to limit the scope of the present invention in any way. All patent and literature 
references cited in the present specification are hereby incorporated by reference in their 
entirety. 

5 EXAMPLES 

Commercially available reagents referred to in the examples were used according 
to manufacturers instructions unless otherwise indicated. The source of those cells 
identified in the following examples, and throughout the specification, by ATCC 
accession numbers is the American Type Culture Collection, 10801 University 
10 Boulevard, Manassas, Virginia, US A. 

EXAMPLE 1 

Isolation of Rat TRAG cDNA and Determination of Tissue Expression 
Loss of TGF-P 1 induced growth inhibition is an early event during spontaneous 

15 transformation of RLE cells. (A.C. Hugget et al, Cancer Res. , 51 :5929 (1991)). This 
resistance to the growth inhibitory effects of TGF-P 1 can be caused by multiple factors. 
(E.R. Barrack, Prostate , 31:61 (1997); R.W. Padgett et al., Cytokine Growth Factor Rev. , 
8:1 (1997); L. Attisano, et al., Cytokine Growth Factor Rev. , 7:327 (1996)). However, 
we observed a number of the transformed cell lines displaying resistance to the growth 

20 inhibitory effects of TGF-P 1 that apparently had a "normal" number of TGF-p receptors. 
(A.C. Hugget et al., Cancer Res. , 51:5929 (1991)). These observations suggested a post- 
receptor disruption of the growth inhibitory signal(s) of TGF-P 1 . To search for 
endogenous genes that confer resistance to the growth inhibitory effects of TGF-J31, and 
which in turn may lead to cellular transformation, subtractive hybridization (J.T. 

25 Woitach et al., Nat. Genet. , 19:371 (1998)) was done on two rat liver epithelium (RLE)- 
derived cell lines that were sensitive (RLE phi 13) or insensitive (B5T) to TGF-p. (For a 
discussion about the production of RLE phi 13 and B5T cell lines, see A.C. Huggett et 
al, Cancer Research , 51:5929 (1991)). B5T designates a cell line derived from an RLE 
parent line by multiple passages (i.e. allowing cells to grow to fill a culture plate 

30 (confluency), and then diluting these cells into a fresh plate and allowing them to once 
again reach confluency). Unlike RLE phi 13 cells, B5T cells are insensitive to TGF-p 
and exhibit a transformed phenotype. 
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A novel transcript, TRAG (TGF-p Resistance Associated Gene), was identified 
and shown to be overexpressed in the B5T cells as well as one other transformed RLE 
line (C4T) derived in an identical way to B5T (A.C. Huggett et al., Cancer Research , 
51:5929 (1991)), both of which are resistant to the effects of TGF-pi (Figure 1). 

5 

Isolation of Rat TRAG cDNA 

To isolate rat TRAG cDNA, poly-A* was first isolated from RLE phi 13 cells 
and B5T cells by using oligo (dt) cellulose, as previously described (Bradley JE, Bishop 
GA, St John T, Frelinger JA, Biotechniques, 6:1 14-1 16 (1988)). The cDNAs were 

10 synthesized using Superscript reverse transcriptase (BRL) as recommended by the 
vendor; added with BstXI adaptor (InVitrogen); and used to produce RLE phi 13 and 
B5T cDNA libraries in pcDNAIneo. 

For B5T cDNA enriched subtractive cDNA library construction, B5T and RLE 
phi 13 cDNAs were digested with Hindlll+Xbal and BamHI+XhoI, respectively, and 

15 subjected to agarose gel purification. Digested RLE phi 13 cDNA fragments (20 jxg) 
were digested further with Alul+Rsal and dephosphorylated, then hybridized with 
Hindlll+Xbal digested B5T cDNA fragments (0.4 |ag). The hybridization mixture was 
used to construct the B5T cDNA enriched subtractive cDNA library in BluescriptM13 
with Hindlll+Xbal protruding termini. The enriched library was plated and single 

20 colonies were isolated. The plasmid from each clone/colony was sequenced, and novel 
sequences were used to screen a panel of rat tumor samples and synchronized cells. A 
0.8kb cDNA clone encoding part of the 3' untranslated region (3'UTR) of the TRAG 
gene was isolated. This clone was picked for its overexpression in B5T cells, relative to 
RLE cells, following screening by Northern hybridization techniques, and for its novel 

25 sequence. The single similar (homologous) sequence that was found in the sequence 
databases represented an uncharacterized cDNA clone from human brain, identified by 
the name KIAA0541 (GenBank Accession Number AB01 1113). Using primers designed 
against this sequence, and making use of reverse transcriptase (RT)-PCR and sequencing, 
the remainder of the 3' untranslated region and part of the coding region were isolated 

30 and characterized from rat (Table 1). RT-PCR describes a molecular biology technique 
by which isolated single-stranded complementary DNA (cDNA) is produced from total 
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mRNA or poly-A+ RNA using the Reverse Transcriptase enzyme (Superscript II from 
Gibco, BRL Life Technologies). This cDNA can then be used as a template for 
amplification in a PCR reaction. The template for RT-PCR was total RNA isolated from 
B5T cells (derived from RLE parent cells) or fresh brain tissue from rat. 
5 The remainder of the coding region of rat TRAG, in particular, ~2kb of the 5 f 

upstream region (including the transcription and translation start sites) was obtained 
using the RACE (random amplification of cDNA ends) technique, as described in the 
manufacturers instructions accompanying the SMART® RACE kit (Clontech, Palo Alto, 
CA). The human TRAG sequence homologous to this region was not present in the 

10 KIAA0541 clone. 

Sequence specific to rat brain TRAG cDNA was isolated as described above from 
cDNA obtained by RT (reverse transcriptase) experiments using total RNA extracted 
from Fisher F344 rat brain tissue (Table 1 , bases 2876-2968). 

The final size of the coding region for rat TRAG cDNA was 4,464 base pairs, 

15 encoding a protein of 1,488 amino acids. 

Plasmids and PCR products were sequenced using the BigDye Terminator Ready 
Reaction Kit (cat# 4303 149, Perkin-Elmer Applied Biosystems). The reactions were 
purified using Centriflac Gel Filtration Columns (cat# 42453, Edge Biosystems, Inc.). 
The samples were analyzed on a 377 ABI Prism DNA sequencer (Perkin-Elmer Applied 

20 Biosystems) to determine the primary nucleotide sequence. The predicted amino acid 
sequence having 1 ,488 amino acids (with a predicted Mr ~ 1 65,000 Daltons) was then 
derived using PCGENE. The derived amino acid sequence is indicated by the 3-letter 
code below the corresponding bases in Table 1 . 

25 Isolation of Mouse TRAG cDNA 

The mouse TRAG gene was isolated by RT-PCR and RACE-PCR as described 
above for the rat TRAG gene, except that the primers used were designed against the rat 
TRAG sequence (which is closer to mouse TRAG that human is). Brain-specific 
sequence (Table 3, bases 2876-2971) was obtained as outlines above, using brain tissue 

30 from the SvJ129 mouse strain. Mouse genomic clones were isolated by screening a 
mouse B AC genomic libraries from the SvJ strain of mouse (performed by Genome 
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Systems, Inc.). The template for RT-PCR was total RNA isolated from fresh liver or 
brain for mouse. The template derived from mouse brain was used to sequence cDNA 
specific to brain tissue, which contained an additional coding region not found in the 
liver. The template derived from mouse liver was used to sequence cDNA specific to 
5 liver, which was shown to lack that coding region present in brain tissue. 

The mouse cDNA sequence was determined as described above for rat cDNA. A 
. cDNA clone having 4,467 base pairs was generated from mouse by RT-PCR and RACE- 
PCR (Table 3). 

The predicted amino acid sequence having 1,489 amino acids (with a predicted 
10 Mr - 165,000 Daltons) was then derived using PCGENE. The derived amino acid 
sequence is indicated by the 3-letter code below the corresponding bases in Table 3. 

Isolation of Human TRAG cDNA 

The human TRAG gene was isolated from human poly-A + RNA obtained 

15 commercially from Clontech (cat# 65 1 0-1 , Palo Alto, CA) by RT-PCR, as described 
above, but using human-specific oligonucleotide primers which were obtained from 
sequence data deposited in GenBank for KIAA0541 (Accession Number AB01 1 1 13). 
These primers were chosen because this GenBank sequence showed homology to the 
3'UTR of the cDNA clone derived from the B5T cell described above for the isolation of 

20 the rat TRAG sequence. However, this human sequence (KIAA0541) was incomplete, 
lacking the 5' ~1.5 to ~2kb of the TRAG gene, including the transcription and translation 
start sites. The sequence of this missing region was determined by searching GenBank 
with the relevant mouse sequence (i.e. the first ~2kb of the mouse TRAG gene). This 
search yielded an uncharacterized genomic clone from chromosome 18 (containing both 

25 intronic and exonic sequence from the 5'-most region of the TRAG gene). By comparing 
mouse TRAG cDNA to this human genomic sequence, it was possible to detect the 
human exonic sequences and assemble them into a recognizable cDNA sequence. The 
entire human TRAG sequence was confirmed using RT-PCR of human poly-A + RNA 
(see above) and sequencing. The exon specific to brain, which was present in the 

30 KIAA0541 sequence, can be found in Table 2, bases 2924-3022. 
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A cDNA clone having 4,470 base pairs was generated from human by RT-PCR 
(Table 2). The predicted amino acid sequence having 1,490 amino acids (with a predicted 
Mr ~ 165,000 Daltons) was then derived using PCGENE. The derived amino acid 
sequence is indicated by the 3-letter code below the corresponding bases in Table 2. 
5 The comparison of the cDNA sequence of TRAG from human, mouse, and rat 

indicates that TRAG is a highly conserved gene in all of these species, as is the amino 
acid sequence (Table 4). 

Characterization of TRAG cDNA and Protein 

10 DNA homology searches of GenBank revealed TRAG to be a unique transcript. 

Yet some homologous sequences were identified. A cDNA clone from human brain 
(KIAA0541, GenBank Accession No. AB01 1 113) having approximately 87% nucleotide 
homology with the mouse TRAG gene was identified. A putative Drosophila G-protein 
subunit (GenBank Accession No. AL021086) having approximately 52% homology with 

15 the mouse TRAG protein was also identified using the BLAST search program. This 
homology is shown in Table 5. We compared the deduced amino acid sequence of 
TRAG with known proteins in the Swissprot database. TRAG contains no domains 
similar to other proteins. However, the deduced protein sequence does contain 4 WD 
repeat motifs (Tables 1, 2 and 3, underlined). 

20 Based on the identification of the WD repeat motifs, TRAG may interact with the 

TGF-p pathway through these WD repeat motifs in a way similar to TRIP-1 and 
STRAP. Considering the means by which TRAG was isolated (i.e. by virtue of its 
overexpression in TGF-[3 resistant cells), it could play an important role in the TGF-ji 
pathway and upregulation may interfere with normal TGF-(5 signaling. TRAG may 

25 interact directly with one of the TGF-p receptors and/or with another WD repeat- 
containing protein in the pathway, for example TRIP-1 and/or STRAP. We have already 
demonstrated an association of TRAG overexpression with a loss of TGF-p sensitivity 
and with an aggressive and malignant phenotype (see Examples below). 
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TRAG Expression 

The expression of TRAG in three types of rat cells (B5T, RLE phi 13, and C4T) 
was analyzed by Northern blot (Figure 1 A) to determine whether any difference in 
expression existed in the transformed (TGF-p resistant) cell lines relative to the 
5 nontransformed (TGF-p sensitive) RLE phi 13 cell line. 

To determine TRAG expression levels in certain rat cells, RNA was first isolated 
from 3 rat cell lines (transformed B5T, transformed C4T, and untransformed RLE cells) 
(A.C. Huggett, et al., Cancer Res. , 51:5929 (1991)), poly-A + mRNA selected using oligo 
(dt) cellulose (Bradley JE, Bishop GA, St John T, Frelinger JA (1988), Biotechniques, 
6: 1 14-116), and 5 |jg (10 ^g of total RNA) was then fractionated under denaturing 
conditions by electrophoresis on a 1% agarose gel, containing formaldehyde (0.22 M) 
and lx MOPS (50 mM 3-[N-morpholino] phopanesulfonic acid and ImM EDTA). 
Samples were electrophoresed for about 3 hours at 80V, blotted overnight onto 
nitrocellulose membranes, and cross-linked under ultraviolet light. Membranes were 
prehybridized in QuikHyb (Stratagene, La Jolla, CA) for 15 minutes at 68°C, labeled 
probe (0.8kb dsDNA against part of the 3TJTR; about lxlO 6 cpm/ml) was added, and the 
membrane and probe were incubated for about 60 minutes at 68°C. The membrane was 
washed and then exposed to phosphoimager plates for 48 hours. The results are shown 
in Figure 1 A. 

The expressed TRAG levels were then quantified (Figure IB). The results 

indicate that TGF-p resistant B5T cells express higher levels (up to 7 fold) of TRAG 
than do RLE phi 13 cells. 

EXAMPLE 2 

Chromosomal Localization of TRAG in Mouse and Human 

Chromosomal localization of TRAG in mouse and human was done by FISH 
analysis (Stokke, T. 5 Collins, C, Kuo, W., Kowbel, D., Shadrvan, F., Tanner, M., 
Kallionienmi, A., Kallioniemi, O., Pinkel, D., Deaven, L., and Gray, J • A physical map 
of chromosome 20 established using fluorescent in situ hybridization (FISH) and digital 
image analysis, Genomics, 26: 134-137 (1995)). A FISH analysis generally involves 
probing chromosomes with a labeled DNA. The probes used to localize TRAG on 
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mouse and human chromosomes were prepared by labeling purified mouse and human 
DNA, respectively, with biotin (Random-Prime labeling kit: Boehringer-Mannheim 
Corp., Indianapolis, IN). 

To localize TRAG on the mouse chromosome, a unique BAC clone (BAC21521) 
5 isolated from a 129SvJ mouse kidney library was used as a probe. This BAC clone 
was isolated by using a probe to the 3' untranslated region (UTR) of the rat TRAG gene 
(screening performed commercially by Genome Systems, Inc., St. Louis, MO). 

To localize TRAG on the human chromosome, a ~5kb DNA probe 
complementary to the central portion of the human TRAG gene (exons 12-14, including 
10 intervening introns), obtained by PCR amplification of human genomic DNA, was used. 

The biotin-labeled DNA probe was hybridized in situ to chromosomes derived 
from normal methotrexate-synthronized peripheral leukocyte cultures for human and to 
chromosomes derived from normal spleen cells in the case of mouse. Overall, forty-eight 
cells were studied. 

15 The conditions used for hyridization, processing, analysis, direct flourescent 

signal localization, and banded chromosomes have been described previously in detail 
(Zimonjic et al., Cytogenet Cell Genet, 65:184 (1994)). 

The TRAG gene was localized to chromosome 1 8D. 1HB.3 in the mouse (see 
Figure 2A) and to chromosome 18q21.1-22 in the human (see Figure 2B). 

20 The long arm of human chromosome 18, designated 1 8q, and particularly the 

distal end of that region, has been found to be involved in a number of cancers and 
developmental disorders. The 18q region contains a number of genes which have been 
shown to play an integral role in various tumorigenic pathways by deletion studies. 
These genes include DGC (deleted in colorectal cancer) (L. Hendrick, et al., Genes Dev. , 

25 8:1 174 (1994)), BCL-2 (Y Tsujimito et al., Science, 228:1440 (1985)), and DPC4 
(deleted in pancreatic cancer) (SA Hahn et al. . Science. 271 :350 (1996)). TRAG sits 
between DCC and BCL-2 on chromosome 18q, a region that has also been shown to 
potentially be involved in a chromosomal rearrangement-associated mucosa-associated 
lymphoid tissue (MALT) lymphomas, (A Stoffel et al., Genes Chromosomes Cancer. 

30 24:156(1999)). 

Inherited monosomy of 18q (i.e. loss of only one allele, so-called "18q~ 
syndrome") has been associated with a variety of developmental and physiological 
defects, particularly those of the brain. (AD Kline et al.. Am. J. Hum. Genet. , 52:895 
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(1993)). Associated with 18q~ syndrome are various dysmorphic features as well as a 
string of neurological and other brain abnormalities, including microcephaly and 
abnormal MRI. (AD Kline et al., Am. J. Hum. Genet, 52:895 (1993)). The severity of 
defects appears to be correlated with the extent of the 18q deletion involved. For 
5 example, patients studied with deletions in the region of the TRAG gene showed more 
severe defects, particularly of brain development, than those carrying deletions that did 
not incorporate this area of the chromosome. (AD Kline et al., Am. J. Hum. Genet. , 
52:895 (1993)). This circumstantial evidence, coupled with the data showing the highest 
levels of TRAG expression in the brain (see Example 3), leads us to conclude that 
beyond a role in carcinogenesis, the TRAG gene may play a vital role in brain and mental 
development. 

EXAMPLE 3 
Determination of Expression of TRAG 
To determine the normal expression of TRAG, a commercial multiple tissue 
RNA blot from rat (Clontech, Palo Alto, CA) was probed, utilizing Northern blotting 
techniques, with a radioactively labeled TRAG probe as described in Example 1 . 

The results are shown in Figure 3 A. Figure 3 A shows TRAG mRNA having a 
transcript size of 7.2 kb in spleen, lung, liver, muscle, and kidney, and TRAG mRNA 
having a transcript size of 3.5 kb in the testis. TRAG mRNA having a variety of 
transcript sizes (arrow heads in Figure 3 A) was identified in brain suggesting alternative 
splicing. 

Other TRAG expression tests were conducted using mouse and human dot blot 
membranes (Clontech, Palo Alto, CA), utililizing the identical Northern blot analysis 
protocol as detailed in Example 1, except that different probes were used. For human 
tissues, a probe of approximately 1.5 kb of human TRAG from the 5' end of the gene was 
used, and for mouse tissues, a probe of approximately 2.5 kb of mouse TRAG from the 
center of the gene was used. 
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Figure 3B shows the results for expression in human and Figure 3C shows the 
results for expression in mouse. Although the commercial multiple tissue RNA blot 
from rat did not show TRAG mRNA in heart (Figure 3 A), the dot blot corresponding to 
expression in human did show mRNA for TRAG in heart (Figure 3B). The dot blot 
5 showed the most abundant TRAG mRNA expression in brain (Figures 3B and 3C). The 
dot blot also showed TRAG mRNA in all fetal tissues as well as all tissues during 
embryonal development (Figures 3B and 3C). 

EXAMPLE 4 

10 Analysis of TRAG Protein 

To evaluate the expression of TRAG protein, protein extracts from three rat cell 
lines were studied (RLE phi 13, B5T, and C4T) by Western blot. Protein (40 jig) from 
each of the cell lines was collected by lysis of cells in RIP A lysis buffer (1% Tergitol NP- 
40, 0.5% sodium deoxycholate, 0.1% SDS, in IX PBS). Inhibitors were added just 
15 before use (Complete, Mini; Boehringer-Mannheim, Burlingame, CA) according to 
manufacturer's instructions. Lysis buffer was added to plates after a brief wash with IX 
PBS (containing inhibitors), cells were scraped from the plate and placed on ice for 5 
minutes. After passing lysate though a 22 gauge syringe needle, samples were 
centrifuged at 4°C and 14,000g for 30 minutes. The supernatant was collected as crude 
20 extract and the protein concentration determined using the Bio-Rad Protein Assay (Bio- 
Rad, Hercules, CA). This protein (30(ig to 40^g) was electrophoresed for 2.5h, 80V at 
room temperature on a polyacrylamide gel using aNovex XCell II Mini-cell System 
(Novex, San Diego, CA), according to manufacturers instructions, and then blotted onto 
nitrocellulose (25V, 2 hours). The nitrocellulose was incubated in a solution of 5% non- 
25 fat milk powder (NFMP), 3% BSA (bovine serum albumin), and 0.5% Tween-20 in 
Tris-buffered saline (TBS) for 1 hour at room temperature, washed 3 times in TBS + 
0.05% Tween-20 (TBS-T), and placed in a solution of 5% NFMP, 3% BSA, and TBS- 
T (l°/2° Antibody Solution). To this was added the primary rabbit anti-TRAG antibody 
(1 :1 000 dilution; production of antibody described in Example 7) and the membrane 
30 incubated overnight at 4°C. After 3 TBS-T washes, secondary antibody (anti-rabbit 
horse radish peroxidase (HRP)-conjugated IgG; Santa Cruz Biotechnology, Santa Cruz, 
CA) was applied to the membrane in the same l°/2° Antibody Solution used for the 
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primary antibody. Incubation was one hour at room temperature. After 3 TBS-T 
washes, the membrane was placed in ECL solution (Amersham, Arlington Heights, IL) 
for 1 minute, according to the supplied protocol, followed by exposure to Kodak X- 
Omat ARiilm for between 30 seconds and 2 minutes. The primary polyclonal anti- 
5 TRAG antibody that was used was raised in a rabbit against a peptide from the TRAG 
protein. This was done commercially by Genemed Biotechnologies Inc., San Francisco, 
CA, as described in Example 7. 

The results are shown in Figure 4. Figure 4 shows TRAG protein expressed in all 
three cell lines. Yet B5T and C4T show greater amounts of protein than RLE phi 13. 
10 The TRAG protein appears to be about 100 kDa to about 105 kDa. Although this 
molecular weight is not immediately compatible with a coding sequence of 
approximately 4.4 kb (see Table 1), it is believed that alternative splicing produces a 
transcript of between about 3.3 kb and about 3.5 kb. 

15 EXAMPLE 5 

Determination of the Status of TRAG in Tumor Cell Lines 
To determine the status of TRAG in tumor cell lines, protein extract and mRNA 
extract from a variety of tumor cell lines were studied by Western blot and Northern blot, 
respectively. In one study, protein extract and mRNA extract from cells that were 

20 chemically transformed using aflatoxin or virally transformed were examined. The 
chemically transformed cells were AFL-B8 and AFL-D8, and the virally transformed 
cells were 361 1T2, 361 1T5, and J2-14. These cells were produced in our laboratory 
according to JB McMahon et al., Cancer Res. , 46:4665 (1986) and AC Huggett et al., 
Cancer Res. , 51:5929 (1991) . The levels of TRAG protein and mRNA in these cells 

25 were compared to levels of TRAG mRNA and protein in RLE phi 1 3 and B5T cells. 

To determine TRAG protein level in these cells, a Western blot was conducted. 
Protein (40 jag) was isolated from each of the cell lines as described above in Example 4. 
The isolated protein was electrophoresed on a polyacrylamide gel and blotted onto 
nitrocellulose according to the methods described in Example 4. The nitrocellulose was 

30 then probed with the polyclonal antibody (1 : 1,000 dilution) according to Example 4. 
The results are shown in Figure 5 A. The level of TRAG protein correlated with the 
aggressiveness and metastatic ability of the particular cell lines. For example, J2-14, 
which is a highly aggressive metastatic tumor cell line transformed with c-raf and v- 
myc, shows the highest level of TRAG protein, while 361 1T2, which is metastatic, shows 
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a level of TRAG protein that is greater than 361 1T5, which is nonmetastatic. 
Aggressiveness and metastatic ability were determined by observation of tumors 
following injection into nude mice (PJ Worland, et al., Mol. Carcinog .. 3:20-29 (1990)). 

To determine TRAG mRNA level in these cells, a Northern blot was conducted, 
as described above in Example 1. The results are shown in Figure 5 A. 

In another study, protein extract and mRNA extract from tumor cell lines derived 
from double transgenic c-myc/TG¥-a mice (i.e., 221T1, 223B.3, 241T1, 241T3, and 
263B. 1) were evaluated. To determine TRAG protein level in these cells, a Western blot 
was conducted as described above. In this blot, the cancer cell lines were compared to 
B5T cells and to RLE phi 13 cells. The results are shown in Figure 5B. As for the 
transformed rat cell line, TRAG protein levels can be seen to be greatly elevated in these 
cells relative to RLE, and at comparable levels to B5T. This data strengthens the 
association between high TRAG levels and a transformed phenotype. 

To determine TRAG mRNA level in these cells, a Northern blot was conducted 
as described above. The results are shown in Figure 5B. 

In another study, protein extract and mRNA extract from the following human 
tumor cell lines were evaluated: Alex, Chang, FOCUS, HepG2, Huh-7, Sk-Hep-1, 
WRL-68, and HeLa. To determine the level of TRAG protein, a Western blot as 
described above was conducted. The results are shown in Figure 5C. Figure 5C shows a 
significant increase in TRAG protein in HepG2, which is an aggressive, transformed cell 
line. 

To determine TRAG mRNA level in these cells, a Northern blot was conducted 
as described above. The results are shown in Figure 5C. 

In yet another study, protein extract and mRNA extract from fresh, primary 
tumors taken from live double transgenic TGF-a/c-wyc mice were evaluated and 
compared to the corresponding levels in RLE phi 13 and B5T cells. Figure 5D shows 
that liver tumors from double transgenic TGF-a/c-myc mice have elevated levels of 
TRAG protein relative to RLE phi 13. Approximately 75% of the primary liver tumors 
(T) isolated from TGF-<x/c~myc mice showed higher levels of TRAG protein than did 
normal tissue (N) (Figure 5D). 

EXAMPLE 6 

The Determination of Cellular Localization of the TRAG Protein 
The cellular localization of the TRAG protein was determined by confocal 
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microscopy. To make this determination, RLE phi 13 and B5T cells weretransfected 
with empty (i.e. no TRAG insert) green fluorescent protein (GFP) vector (Clontech, Palo 
Alto, CA). These cells acted as the control. Other RLE phi 13 and B5T cells were 
transfected with the N-terminal GFP protein vector also containing the TRAG gene. This 
5 N-terminal fusion protein vector was constructed by inserting an EcoRI/Sall fragment 
encoding full-length TRAG into the pEGFP-N2 vector (Clontech, Palo Alto, CA) 
containing the green fluorescent protein (GFP). Transfection was carried out using 
LipofectAMDME PLUS reagent (Gibco, BRL Life Technologies, Rockville, MD), 
according to manufacturer's instructions. Microscopy conditions were: 20X zoom and 
10 3% laser power (for B5T) or 10% laser power for RLE phi 13. The results are shown in 
Figures 6A and 6B. 

Cells transfected with empty (pEGFP-N2) vector displayed GFP signal over the 
entire cell (Figures 6A, panels under "GFP-only"). But cells having the TRAG-GFP 
fusion protein vector showed only cytoplasmic fluorescence (Figure 6A, panels under 

15 "TRAG-GFP"). 

The cellular localization of the TRAG protein was confirmed by 
immiinohistochemistry. Immunohistochemistry was conducted according to the standard 
procedures described in E. Harlow and D. Lane, "Using Antibodies: A Laboratory 
Manual/ 1 Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY (1999). In 

20 particular, the TRAG protein in paraformaldehyde-fixed RLE phi 13 and B5T cells was 
probed using anti-TRAG polyclonal antibody (1 : 1 ,000 dilution). The polyclonal 
antibody was prepared according to Example 4. The results are shown in Figure 6B. 
Figure 6B shows that the TRAG protein localizes to the cytoplasm, but not to the 
nucleus. In fact, strongest staining is seen around the nucleus, within the cytoplasm (i.e. 

25 perinuclear). 

These results were compared to staining for pre-immune (PI) serum as a negative 
control. PI serum describes the serum removed from the rabbit before it is immunized. 
That is, before the rabbit was challenged with the TRAG polypeptide and induced to 
make the anti-TRAG antibody. Figure 6B shows significantly lighter staining for pre- 
30 immune serum than for RLE phi 13 and B5T cells. 
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EXAMPLE 7 
Preparation of Antibodies that Bind TRAG 
Polyclonal antibodies to TRAG were generated commercially by Genemed 
Synthesis, Inc. using art-accepted methods. Initially, the rabbits were immunized with a 
5 polypeptide corresponding to amino acids 760-781 of the rat TRAG sequence (Table 1). 
Subsequent Western Blot experiments (Example 4) showed that the anti-TRAG antibody 
cross-reacted well with both mouse and human TRAG. 

The following example illustrates preparation of monoclonal antibodies that can 
specifically bind TRAG. 

10 Techniques for producing the monoclonal antibodies are known in the art and are 

described, for instance, in Goding, supra . Immunogens that may be employed include 
purified TRAG, fusion proteins containing TRAG, and cells expressing recombinant 
TRAG on the cell surface. Selection of the immunogen can be made by the skilled 
artisan without undue experimentation. 

15 Mice, such as Balb/c, are immunized with the TRAG immunogen emulsified in 

complete Freund's adjuvant and injected subcutaneously or intraperitoneally in an 
amount from 1-1 00 micrograms. Alternatively, the immunogen is emulsified in MPL- 
TDM adjuvant (Ribi Immunochemical Research, Hamilton, MT) and injected into the 
animal's hind foot pads. The immunized mice are then boosted 10 to 12 days later with 

20 additional immunogen emulsified in the selected adjuvant. Thereafter, for several weeks, 
the mice may also be boosted with additional immunization injections. Serum samples 
may be periodically obtained from the mice by retro-orbital bleeding for testing in 
ELISA assays to detect TRAG antibodies. 

After a suitable antibody titer has been detected, the animals "positive" for 

25 antibodies can be injected with a final intravenous injection of TRAG. Three to four 

days later, the mice are sacrificed and the spleen cells are harvested. The spleen cells are 
then fused (using 35% polyethylene glycol) to a selected murine myeloma cell line such 
as P3X63AgU.l, available from ATCC, No. CRL 1597. The fusions generate hybridoma 
cells which can then be plated in 96 well tissue culture plates containing HAT 

30 (hypoxanthine, aminopterin, and thymidine) medium to inhibit proliferation of non- 
fused cells, myeloma hybrids, and spleen cell hybrids. 

The hybridoma cells will be screened in an ELISA for reactivity against TRAG. 
Determination of "positive" hybridoma cells secreting the desired monoclonal antibodies 
against TRAG is within the skill in the art. 
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The positive hybridoma cells can be injected intraperitoneally into syngeneic 
Balb/c mice to produce ascites containing the anti-TRAG monoclonal antibodies. 
Alternatively, the hybridoma cells can be grown in tissue culture flasks or roller bottles. 
Purification of the monoclonal antibodies produced in the ascites can be accomplished 
5 using ammonium sulfate precipitation, followed by gel exclusion chromatography. 
Alternatively, affinity chromatography based upon binding of antibody to protein A or 
protein G can be employed. 

EXAMPLE 8 

10 Cloning of mouse TRAG gene in pcDNA3.1 + 

The foil length mouse TRAG gene was cloned into pcDNA3. 1 + vector in three 
parts: a 5' EcoRI/Cla I fragment, a central Clal/Xhol, and a 3 1 Xhol/Xbal fragment. In 
all cases, PCR-amplified fragments were cloned into pGEM®-T Easy vectors (Promega, 
Madison, WI) according to manufacturer's instructions and using E. coli XL-2 Blue cells 

1 5 supercompetent cells or methylation-deficienct SCS 110 cells (in case of the methylation 
sensitive restriction enzymes Xbal or Clal; both cell types available from Strategene, La 
Jolla, CA). Desired regions from this PCR fragment were then cut out of the vectors 
using the abovementioned site-specific restriction enzymes. These fragments were 
joined in consecutive steps by subcloning into a pGEM®-T Easy vector. Finally, the 

20 complete gene fragment was subcloned into pcDNA3. 1 + , under the CMV promoter. This 
provides a construct ready for transfection into any cell type for the purposes of 
determining the effect of overexpression of TRAG, for example, in RLE phi 13 cells, 
which normally have low levels of this protein (as demonstrated in Examples 1 and 4). 

25 EXAMPLE 9 

Generation of TRAG Transgenic and Knockout Mice 
To elucidate more folly the potential role of TRAG in development and in 
carcinogenesis, the breeding of a knockout mouse is carried out. Part of the TRAG gene 
is deleted from a mouse line by standard knockout techniques, resulting in a truncated or 
30 nonexistent protein product (as described above in section D.3). If the complete removal 
of the TRAG gene product produces an embryonic lethal, a conditional knockout could 
be considered. This involves engineering the TRAG knockout construct in such a way 
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that the deletion of the critical region of the TRAG gene occurs only after 
embryogenesis. In this way, the role of TRAG in an adult animal can be examined. 

To determine the role of TRAG in an aggressive phenotype in tumor cell lines, a 
transgenic mouse model can be studied. To study a transgenic mouse model, the TRAG 

5 gene is inserted into the mouse genome under the control of a strong, constitutively 
active promoter element (which causes large amounts of the protein to be made at all 
times) or a conditionally active promoter element (which would induce large amounts of 
TRAG only on a specific stimulus). (See, for example, C Jhappen et al., Cell, 61:1137 
(1990)). TRAG overexpression is evaluated by Northern blot analysis, as described in 

10 Example 1 . In particular, the transgenic mice are studied to determine if TRAG 
overexpression shows more rapid or enhanced tumor formation or more aggressive 
metastasis. 



TABLES 

15 Table 1 shows the cDNA (SEQ ID NO: 1) and amino acid (SEQ ID NO: 2) 

sequences of rat TRAG from liver as determined by double strand sequence analysis of 
PCR products amplified from RLE (rat liver epithelium) cell and Fisher F344 rat brain 
tissue cDNA. The WD repeat elements are indicated by the underline, and the two 
tyrosine phosphorylation motifs are indicated in boldface type. 

20 10 20 30 40 50 

GAAGCTTTGACAGGTTTGAAAAACACAATGGCAGGAAACAGTCTAGTTCTG CCCATT 

MetAlaGlyAsnSerLeuValLe uProIle 



60 70 80 90 100 110 

25 GTTCTCTGGGGCCGCAAAGCACCCACACACTGTATTTCGTCAATACTGCTGACAGAT 
ValLeuTrpGlyArqLysAlaProThrHisCysIleSerSerlleLeuLeuThrAsp 

120 130 140 150 160 170 

GATGGGGGCACGATTGTGACTGGATGCCACGATGGACAGATATGTCTCTGGGAT CTC 
30 AspGlyGlyThrlleValThrGlyCysHisAspGlyGlnlleCysLeuTrpAsp Leu 

180 190 200 210 220 

TCAGAAGAGCTGGAAGTTAAT CCCCGAGCACTATTATTTGGTCACACAGCAGCCATC 
SerGluGluLeuGluValAsn ProArgAlaLeuLeuPheGlyHisThrAlaAlalle 

35 

230 240 250 260 270 280 

ACTTGTTTGTCAAAAGCCTGTGCTTCTGGAGACAAACAGTACACCGTGAGTGCGTCC 
ThrCysLeuSerLysAlaCysAlaSerGlyAspLysGlnTyrThrValSerAlaSer 

40 290 300 310 320 330 340 

GCAAATGGAGAGATGTGCCTCTGGGAT GTGAATGACGGCAGATGTATTGAATTTACA 
AlaAsnGlyGluMetCysLeuTrpAspV alAsnAspGlyArqCysIleGluPheThr 

350 360 370 380 390 
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AAATTAGCCTGCACACATACTGGCATACAGTTCTACCAGTTCTCTGTTGGAAATCAG 
LysLeuAlaCysThrHisThrGlylleGlnPheTyrGlnPheSerValGlyAsnGln 

400 410 420 430 440 450 

5 - CGAGAGGGAAGGCTTCTGTGCCATGGGCATTACCCTGAAATCCTCGTTGTGGATGCC 

ArgGluGlyArgLeuLeuCysHisGlyHisTyrProGluIleLeuValValAspAla 

460 470 480 490 500 510 

ACCAGCCTTGAGGTGTTGTATTCCTTGGTATCGAAGATCTCTCCAGACTGGATCAGC 
10 ThrSerLeuGluValLeuTyrSerLeuValSerLysIleSerProAspTrpIleSer 

520 530 540 550 560 570 

TCCATGAGYATYATCCGCTCTCACCGGACACAAGAGGACACTGTGGTAGCGCTGTCC 
SerMetSerllelleArgSerHisArgThrGlnGluAspThrValValAlaLeuSer 

15 

580 590 600 610 620 

GTGACAGGTATTCTGAAGGTGTGGATTGTGACCTCTGAAATTAGTGGATTGCAGGAC 
ValThrGlylleLeuLysValTrplleValThrSerGluIleSerGlyLeuGlnAsp 

20 630 640 650 660 670 680 

AC T GAG C C AAT AT T T GAG GAG G AAT C C AAACC AAT T T AT T GT C AG AAT T G C C AAAG C 
ThrGluProIlePheGluGluGluSerLysProIleTyrCysGlnAsnCysGlnSer 

690 700 710 720 730 740 

25 CTCTCTTTTTGTGCATTCACACAGAGGTCGCTCTTGGTTGTATGCTCCAAGTACTGG 
LeuSerPheCysAlaPheThrGlnArgSerLeuLeuValValCysSerLysTyrTrp 

750 760' 770 780 790 

AGGGTGTTCGATGCTGGCGACTACTCCCTGCTGTGTTCAGGTCCTAGTGAAGATGGA 
30 ArgValPheAspAlaGlyAspTyrSerLeuLeuCysSerGlyProSerGluAspGly 

800 810 820 830 840 850 

CAGACATGGACTGGAGGGGACTTTGTGTCTGCAGACAAAGTCATTATTTGGACTGAA 
GlnThrTrpThrGlyGlyAspPheValSerAlaAspLysValllelleTrpThrGlu 
35 , 

860 870 880 890 900 910 

AACGGGCAGAGTTACATTTACAAACTCCCTGCCAGTTGCCTTCCAGCTAGTGATTCA 
AsnGlyGlnSerTyrlleTyrLysLeuProAlaSerCysLeuProAlaSerAspSer 

40 920 930 940 950 960 

TTCCGCAGTGACGTGGGGAAAGCAGTTGAAAATCTGATTCCTCCCGTGCAGCATAGC 
PheArgSerAspValGlyLysAlaValGluAsnLeuIleProProValGlnHisSer 

970 980 990 1000 1010 1020 

45 CTCTTGGATCAGAAGGATAGAGAGTTGGTAATTTGTCCTCCTGTTACTCGGTTTTTC 
LeuLeuAspGlnLysAspArgGluLeuVallleCysProProValThrArgPhePhe 

1030 1040 1050 1060 1070 1080 

TATGGATGCAAGGAATATTTGCATAAGCTACTAATTCAGGGTGATTCTTCTGGAAGG 
50 TyrGlyCysLysGluTyrLeuHisLysLeuLeuIleGlnGlyAspSerSerGlyArg 

1090 1100 1110 1120 1130 1140 

T TAAG TAT TTGGAACATAGCAGACATAGCAGACAAAC AGGAAGCCAAT G AAGGG CTA 
LeuSerlleTrpAsnlleAlaAspIleAlaAspLysGlnGluAlaAsnGluGlyLeu 

55 

1150 1160 1170 1180 1190 

AAAACGACAACTTGTATTAGTTTGCAAGATGCATTTGACAAACTGAAGCCTTGTCCT 
LysThrThrThrCysIleSerLeuGlnAspAlaPheAspLysLeuLysProCysPro 

60 1200 1210 1220 1230 1240 1250 

GCTGGAATTATCGATCAGCTGAGTGTGATTCCAAACAGCAACGAACCACTTAAAGTA 
AlaGlyllelleAspGlnLeuSerVallleProAsnSerAsnGluProLeuLysVal 
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1260 1270 1280 1290 1300 1310 

ACTGCGAGTGTCTACATACCAGCACACGGGCGCCTTGTTTGCGGCCGGGAAGACGGA 
ThrAlaSerValTyrlleProAlaHisGlyArgLeuValCysGlyArgGluAspGly 

5 1320 1330 1340 1350 1360 

AGCATCATTATCGTCCCTGCCACCCAGACGGCCATAGTTCAGCTGCTGCAGGGAGAA 
SerllellelleValProAlaThrGlnThrAlalleValGlnLeuLeuGlnGlyGlu 

1370 1380 1390 1400 1410 1420 

1 0 CACATGCTCAGACGAGGTTGGCCCCCGCACAG AACCCTCCGTGGCCACCGGAACAAA 
HisMetLeuArgArgGlyTrpProProHisArg ThrLeiiArgGlyHisArqAsnLys 

1430 1440 1450 1460 1470 1480 

GTCACGTGTTTGCTGTATCCTCATCAGGTCTCAGCTCGGTATGATCAAAGATACCTG 
15 ValThrCysLeuLeuTyrProHisGlnValSerAlaArgTyrAspGlnArgTyrLeu 

1490 1500 1510 1520 1530 

ATATCCGGAGGTGTGGATTTTTCCGTCATCATTTGGGACA TTTTTTCCGGAGAAATG 
IleSerGlyGlyValAspPheSerValllelleTrpAsp IlePheSerGlyGluMet 

1540 1550 1560 1570 1580 1590 

AAACATATCTTCTGTGTTCATGGTGGTGAGATCACACAACTTCTGGTCCCGCCAGAA 
LysHisIlePheCysValHisGlyGlyGlulleThrGlnLeuLeuValProProGlu 

25 1600 1610 1620 1630 1640 1650 

AACTGTAGTGCAAGAGTTCAACACTGCGTCTGTTCTGTGGCCAGTGACCACTCTGTA 
AsnCysSerAlaArgValGlnHisCysValCysSerValAlaSerAspHisSerVal 

1660 1670 1680 1690 1700 1710 

30 GGGCTGCTAAGTCTGCG AGAGAAAAAATGCATCATGCTGGCGTCTCGTCACCTGTTC 
GlyLeuLeuSerLeiiArg GluLysLysCysIleMetLeuAlaSerArgHisLeuPhe 

1720 1730 1740 1750 1760 

CCTATTCAGGTGATCAAGTGGAGGCCTTCTGACGACTACCTGGTGGTGGGGTGCACG 
35 ProIleGlnVallleLysTrpArgProSerAspAspTyrLeuValValGlyCysThr 

1770 1780 1790 1800 1810 1820 

GACGGCTCTGTGTGTGTCTGGCAGA TGGACACTGGTGCGCTGGACCGCTGTGCAATG 
AspGlySerValCysValTrpGlnM etAspThrGlyAlaLeuAspArqCysAlaMet 

1830 1840 1850 1860 1870 1880 

GGGATAACAGCCGTGGAGATTCTCAATGCTTGTGACGAAGCTGTCCCTGCAGCAGTG 
GlylleThrAlaValGluIleLeuAsnAlaCysAspGluAlaValProAlaAlaVal 

45 1890 1900 1910 1920 1930 

GACTCACTCAGTCACCCAGCAGTCAACCTGAAGCAAGCCATGACACGGCGGAGTCTC 
AspSerLeuSerHisProAlaValAsnLeuLysGlnAlaMetThrArgArgSerLeu 

1940 1950 1960 1970 1980 1990 

50 GCCGCCCTTAAAAACATGGCCCACCACAAGCTGCAAACCCTTGCAACTAACCTTTTG 
AlaAlaLeuLysAsnMetAlaHisHisLysLeuGlnThrLeuAlaThrAsnLeuLeu 

2000 2010 2020 2030 2040 2050 

GCTTCTGAGGCCTCTGACAAGGGGAATTTACCTAAATATTCTCATAACTCCCTGATG 
55 AlaSerGluAlaSerAspLysGlyAsnLeuProLysTyrSerHisAsnSerLeuMet 

2060 2070 2080 2090 2100 . 

GTTCAAGCAATAAAGACAAACCTAACTGACCCGGATATCCATGTGCTTTTCTTTGAT 
ValGlnAlalleLysThrAsnLeuThrAspProAspIleHisValLeuPhePheAsp 



40 



60 



2110 2120 2130 2140 2150 2160 

GTGGAAGCTTTGATTATTCAACTCCTGACTGAAGAAGCCTCTAGGCCGAATACTGCA 
ValGluAlaLeuIlelleGlnLeuLeuThrGluGluAlaSerArgProAsnThrAla 
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2170 2180 2190 2200 2210 2220 

CTTATTTCCCCAGAGAATCTGCAGAAAGCATCTGGCAGTTCAGACAAAGGGGGCTCT 

LeuIleSerProGluAsnLeuGlnLysAlaSerGlySerSerAspLysGlyGlySer 

5 

2230 2240 2250 2260 2270 2280 

TTCCTGACTGGAAAACGAGCGGCAGTTCTTTTCCAGCAAGTGAAAGAAACTATCAAA 
PheLeuThrGlyLysArgAlaAlaValLeuPheGlnGlnValLysGluThrlleLys 

10 2290 2300 2310 2320 2330 

GAG AAC AT AAAGGAG CACC TCC T T GAT G AGGAGGAGGACGAGGAAGAGGTG AT G AGG 
GluAsnlleLysGluHisLeuLeuAspGluGluGluAspGluGluGluValMetArg 

2340 2350 2360 2370 2380 2390 

1 5 CAGAGGAGGGAAGAAAGTGACCCTGAGTACCGGGCCAGCAAGTCCAAACCACTT ACC 

GlnArgArgGluGluSerAspProGluTyrArgAlaSerLysSerLysProLeuThr 

2400 2410 2420 2430 2440 2450 

CTACTAGAATACAACCTTACTATGGATACCGCAAAATTATTCATGTCCTGTCTCCAC 
20 LeuLeuGluTyrAsnLeuThrMetAspThrAlaLysLeuPheMetSerCysLeuHis 

2460 2470 2480 2490 2500 

GCCTGGGGTTTGAATGAAGTTCTGGATGAAGTTTGCCTCGATCGCCTCGGCATGCTG 
AlaTrpGlyLeuAsnGluValLeuAspGluValCysLeuAspArgLeuGlyMetLeu 

25 

2510 2520 2530 2540 2550 2560 

AAACCACACTGCACAGTGTCCTTTGGTCTCCTATCCAGAGGAGGTCATATGTCCTTG 
LysProHisCysThrValSerPheGlyLeuLeuSerArgGlyGlyHisMetSerLeu 

30 2570 2580 2590 2600 2610 2620 

ATGCTTCCTGGTTATAATCAGGCTGCTGGAAAGCTACTGCAGGCGAAAGCAGAAGCA 
MetLeuProGlyTyrAsnGlnAlaAlaGlyLysLeuLeuGlnAlaLysAlaGluAla 

2630 2640 2650 2660 2670 

35 GGACGGAAGGGGCCAGCAACGGAGAGCGTAGGCAAGGGGACCTACACAGTGTCCCGA 
GlyArgLysGlyProAlaThrGluSerValGlyLysGlyThrTyrThrValSerArg 

2680 2690 2700 2710 2720 2730 

GCGGTCACCACGCAACATCTGTTGTCCATCATATCTCTGGCAAATACTTTAATGAGC 
40 AlaValThrThrGlnHisLeuLeuSerllelleSerLeuAlaAsnThrLeuMetSer 

2740 2750 2760 2770 2780 2790 

ATGACCAATGCGACGTTCATTGGAGATCACATGAAGAAGGGCCCCACCAGGCCGCCT 

MetThrAsnAlaThrPhelleGlyAspHisMetLysLysGlyProThrArgProPro 

45 

2800 2810 2820 2830 2840 2850 

AGACCAGGCACCCCAGACCTTTCTAAGGCAAGGGATTCCCCCCCAGCCTCCAGTAAC 
ArgProGlyThrProAspLeuSerLysAlaArgAspSerProProAlaSerSerAsn 

50 2860 2870 2880 2890 2900 

ATTGTGCAAGGACAGATTAAACAAGCTGCTGCGCCTGTCTCTGCTCGGTCTGCCGCC 
IleValGlnGlyGlnlleLysGlnAlaAlaAlaProValSerAlaArgSerAlaAla 

2910 2920 2930 2940 2950 2960 

55 GACCACTCTGGCTCTGCCTCTGCCTCTCCTGCTTTACGTACCTGCTTCTTAGTGAAT 
AspHisSerGlySerAlaSerAlaSerProAlaLeuArgThrCysPheLeuValAsn 

2970 2980 2990 3000 3010 3020 

GAAGGATGGAGCCAACTAGCTGCCATGCACTGTGTCATGCTGCCGGACCTGCTGGGG 
60 GluGlyTrpSerGlnLeuAlaAlaMetHisCysValMetLeuProAspLeuLeuGly 

3030 3040 3050 3060 3070 

CTGGGTAAATTCAGGCCTCCTCTTCTGGAGATGCTAGCTCGAAGATGGCAAGATCGG 
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LeuGlyLysPheArgProProLeuLeuGluMetLeuAlaArgArgTrpGlnAspArg 

3080 3090 3100 3110 3120 3130 

TGCTTGGAGGTGAGAGAGGCTGCACAGGCCCTTCTTCTAGCAGAGCTGAGAAGAATT 
CysLeuGluValArgGluAlaAlaGlnAlaLeuLeuLexiAlaGluLeuArgArglle 

3140 3150 3160 3170 3180 3190 

GAGCAGGCAGGACGGAAGGAGACTATTGATACCTGGGCTCCTTACTTACCTCAGTAC 
GluGlnAlaGlyArgLysGluThrlleAspThrTrpAlaProTyrLeuProGlnTyr 

3200 3210 3220 3230 3240 

ATGGACCATGTCATATCACCTGGAGTCACGGCGGAAGCCATGCAGACTATGGCAGCT 
MetAspHisVallleSerProGlyValThrAlaGluAlaMetGlnThrMetAlaAla 

15 3250 3260 3270 3280 3290 3300 

GCTCCAGATGCCTCGGGGCCAGAAGCCAAAGTCCAGGAAGAAGAGCATGACCTCGTG 
AlaProAspAlaSerGlyProGluAlaLysValGlnGluGluGluHisAspLeuVal 

3310 3320 3330 3340 3350 3360 

20 GACGATGACATCACCACTGGTTGCTTATCAAGTGTCCCACAAATGAAAAAGATGTCC 
AspAspAspIleThrThrGlyCysLeuSerSerValProGlnMetLysLysMetSer 

3370 3380 3390 3400 3410 3420 

ACATCTTACGAAGAAAGAAGGAAGCAGGCCACTGCTATTGTTCTCCTGGGAGTGATA 
25 ThrSerTyrGluGluArgArgLysGlnAlaThrAlalleValLeuLeuGlyVallle 

3430 3440 3450 3460 3470 

GGAGCAGAGTTTGGAGCTGAAATTGAACCACCAAAACTGCTGACCAGACCTCGGAGC 
GlyAlaGluPheGlyAlaGluIleGluProProLysLeuLeuThrArgProArgSer 

30 

3480 3490 3500 3510 3520 3530 

TCTAGTCAAATTCCTGAAGGATTTGGTTTGACAAGTGGAGGTTCCAACTACTCTCTG 
SerSerGlnlleProGluGlyPheGlyLeuThrSerGlyGlySerAsnTyrSerLeu 

35 3540 3550 3560 3570 3580 3590 

GCCAGACATACGTGCAAGGCACTGACATTTCTTCTGCTACAGCCACCAAGTCCCAAA 
AlaArgHisThrCysLysAlaLeuThrPheLeuLeuLeuGlnProProSerProLys 

3600 3610 3620 3630 3640 

40 CTTCCTCCTCATAGCACCATCCGGAGAACTGCCATTGACCTGATCGGGCGAGGGTTC 
LeuProProHisSerThrlleArgArgThrAlalleAspLeuIleGlyArgGlyPhe 

3650 3660 3670 3680 3690 3700 

ACCGTGTGGGAGCCTTACATGGACGTGTCTGCTGTCCTGATGGGGCTGCTGGAGCTG 
45 ThrValTrpGluProTyrMetAspValSerAlaValLeuMetGlyLeuLeuGluLeu 

3710 3720 3730 3740 3750 3760 

TGTGCAGATGCTGAGAAGCAGCTGGCCAACATCACAATGGGGCTGCCTCTGAGCCCT 
CysAlaAspAlaGluLysGlnLeuAlaAsnlleThrMetGlyLeuProLeuSerPro 

50 

3770 3780 3790 3800 3810 

GCAGCTGACTCTGCCCGATCCGCAAGACACGCCCTTTCTCTCATAGCCACCGCCAGA 
AlaAlaAspSerAlaArgSerAlaArgHisAlaLeuSerLeuIleAlaThrAlaArg 

55 3820 3830 3840 3850 3860 3870 

CCACCCGCCTTCATCACCACCATAGCTAAGGAGGTGCACAGACACACGGCCCTTGCA 
ProProAlaPhelleThrThrlleAlaLysGluValHisArgHisThrAlaLeuAla 

3880 3890 3900 3910 3920 3930 

60 GCAAAT ACCCAG TCCCAGCAGAGT AT CCACACCACCACACTG GCAAGGGCTAAAGGC 

AlaAsnThrGlnSerGlnGlnSerlleHisThrThrThrLeuAlaArgAlaLysGly 

3940 3950 3960 3970 3980 3990 
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GAAATCCTGAGAGTCATTGAAATTCTTATCGAAAAGATGCCTACGGATGTTGTGGAT 
GluIleLeuArgVallleGluIleLeuIleGluLysMetProThrAspValValAsp 

4000 4010 4020 4030 4040 

CTTCTTGTGGAGGTCATGGACATCATCATGTACTGCCTGGAAGGATCTTTAGTTAAG 
LeuLeuValGluValMetAspIlelleMetTyrCysLeuGluGlySerLeuValLys 

4050 4060 4070 4080 4090 4100 

AAGAAGGGTCTTCAGGAGTGTTTCCCAGCCATCTGCAGGTTCTACATGGTCAGCTAT 
LysLysGlyLeuGlnGluCysPheProAlalleCysArgPheTyrMetValSerTyr 

4110 4120 4130 4140 4150 4160 

TATGAGCGGAGTCACAGAATTGCAGTTGGAGCACGCCATGGCTCAGTGGCCCTGTAT 
TyrGluArgSerHisArglleAlaValGlyAlaArgHisGlySerValAlaLeuTyr 

4170 4180 4190 4200 4210 

GACATCCGGACTGGGAAATGTCAGACAATCCACGGACACAAGGGACCTATCACTGCA 
AspIleArgThrGlyLysCysGlnThrlleHisGlyHisLysGlyProIleThrAla 

4220 4230 4240 4250 4260 4270 

GTGTCCTTTGCTCCTGATGGGCGTTACCTTGCCACCTACTCAAACACTGACAGCCAC 
ValSerPheAlaProAspGlyArgTyrLeuAlaThrTyrSerAsnThrAspSerHis 

4280 4290 4300 4310 4320 4330 

ATTTCTTTCTGGCAGATGT^CACCTCACTTCTGGGAAGCATTGGCATGCTGAACTCA 
IleSerPheTrpGlnMetAsnThrSerLeuLeuGlySerIleGlyMetLe\iAsnSer 

4340 4350 4360 4370 4380 

GCACCTCAGCTGCGCTGCATCAAGACCTACCAGGTACCTCCAGTGCAGCCCGCATCC 
AlaProGlnLeuArgCysIleLysThrTyrGlnValProProValGlnProAlaSer 

4390 4400 4410 4420 4430 4440 

CCTGGCTCGCACAACGCCCTCAGGTTGGCCCGGCTCATCTGGACTTCCAACCGGAAT 
ProGlySerHisAsnAlaLeuArgLeuAlaArgLeuIleTrpThrSerAsnArgAsn 

4450 4460 4470 4480 4490 

GTTATCCTCATGGCCCACGATGGGAAGGAGCACCGCTTCATGGTCTGA 
VallleLeuMetAlaHisAspGlyLysGluHisArgPheMetValSTP 

40 

Table 2 shows the cDNA (SEQ ID NO: 3) and amino acid (SEQ ID NO: 4) 
sequences of human TRAG from brain as determined by double strand sequence of PCR 
products amplified from human liver tissue cDNA. The WD repeat elements are 
indicated by the underline, and the two tyrosine phosphorylation motifs are indicated in 
45 boldface type. 

10 20 30 40 50 

T AACAT GT T T TCG AGAGAAAT ATT GTAAT AT CTGACAAT T T T T ATAACAT T T T C AG G 

60 70 80 90 100 110 

50 TTTGAAAACACAAACACAATGGCAGGAAACAGCCTTGTTCT ACCCATTGTTCTTTGG 

MetAlaGlyAsnSerLeuValLeu ProIleValLeuTrp 

120 130 140 150 160 170 

GGTCGAAAAGCGCCCACACATTGCATCTCAGCCGTACTTTTAACAGATGATGGGGCC 
55 GlyArqLysAlaProThrHisCysIleSerAlaValLeuLeuThrAspAspGlyAla 

180 190 200 210 220 

ACG ATCGT AACAGGATG T CACGACGGACAAAT AT GT CT C T GGGAT CTT T C AGT AGAA 
ThrlleValThrGlyCysHisAspGlyGlnlleCysLeuTrpAsp LeuSerValGlu 



10 



15 



20 



25 



56 



WO 01/66739 



5 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



PCT/US01/04475 



230 240 250 260 270 280 

CTGCAAGTTAAT CCTCGAGCACTGTTGTTTGGTCATACAGCATCAATCACTTGTTTG 
LeuGlnValAsn ProArqAlaLeuLeuPheGlyHisThrAlaSerlleThrCysLeu 

290 300 310 320 330 340 

TCTAAAGCTTGTGCTTCCAGTGACAAACAGTATATTGTGAGTGCATCTGAAAGTGGA 
SerLysAlaCysAlaSerSerAspLysGlnTyrlleValSerAlaSerGluSerGly 

350 360 370 380 390 

GAGATGTGCCTCTGGGAT GTGAGTGATGGCAGATGTATTGAATTTACAAAATTAGCT 
GluMetCysLeuTrpAspV alSerAspGlyArqCysIleGluPheThrLysLeuAla 

400 410 420 430 440 450 

TGCACACATACTGGCATACAGTTCTACCAGTTCTCTGTTGGGAATCAGCGAGAAGGA 
CysThrHisThrGlylleGlnPheTyrGlnPheSerValGlyAsnGlnArgGluGly 

460 470 480 490 500 510 

AGGCTTTTATGCCACGGACATTACCCTGAAATCCTTGTTGTGGATGCTACCAGCCTT 
ArgLeuLeuCysHisGlyHisTyrProGluIleLeuValValAspAlaThrSerLeu 

520 530 540 550 560 570 

GAAGTATTATACTCCTTAGTATCAAAGATATCACCAGACTGGATTAGCTCCATGAGT 
GluValLeuTyrSerLeuValSerLysIleSerProAspTrpIleSerSerMetSer 

580 590 600 610 620 

ATTATTCGATCCCACCGAACACAAGAGGACACAGTGGTAGCACTCTCGGTGACTGGC 
IlelleArgSerHisArgThrGlnGluAspThrValValAlaLeuSerValThrGly 

630 640 650 660 670 680 

ATCCTGAAGGTCTGGATTGTTACCTCGGAAATAAGTGACATGCAGGATACTGAGCCA 

IleLeuLysValTrpIleValThrSerGluIleSerAspMetGlnAspThrGluPro 

690 700 710 720 730 740 

AT AT T TGAG GAGGAAT CCAAACCAAT T TAT T GT CAGAAT T G CCAAAGC AT C T CT T T T 
IlePheGluGluGluSerLysProIleTyrCysGlnAsnCysGlnSerlleSerPhe 

750 760 770 780 790 

TGTGCATTTACACAAAGGTCACTTTTGGTTGTGTGTTCCAAATATTGGAGGGTGTTC 
CysAlaPheThrGlnArgSerLeuLeuValValCysSerLysTyrTrpArgValPhe 

800 810 820 830 840 850 

GATGCCGGAGACTATTCCTTGTTGTGTTCAGGTCCTAGTGAAAATGGACAGACATGG 
AspAlaGlyAspTyrSerLeuLeuCysSerGlyProSerGluAsnGlyGlnThrTrp 

860 870 880 890 900 910 

ACCGGGGGGGACTTTGTCTCATCAGATAAAGTCATCATTTGGACAGAAAATGGGCAA 
ThrGlyGlyAspPheValSerSerAspLysValllelleTrpThrGluAsnGlyGln 

920 930 940 950 960 

AGTTATATTTACAAACTACCTGCCAGTTGCCTTCCAGCTAGTGATTCATTCCGCAGT 
SerTyrlleTyrLysLeuProAlaSerCysLeuProAlaSerAspSerPheArgSer 

970 980 990 1000 1010 1020 

GATGTGGGGAAGGCAGTTGAAAATTTAATTCCTCCTGTACAACATATCCTCTTGGAT 
AspValGlyLysAlaValGluAsnLeuIleProProValGlnHisIleLeuLeuAsp 

1030 1040 1050 1060 1070 1080 

CGAAAAGATAAAGAGTTGCTAATTTGTCCTCCTGTTACTCGGTTCTTCTATGGATGC 

ArgLysAspLysGluLeuLeuIleCysProProValThrArgPhePheTyrGlyCys 

1090 1100 1110 1120 1130 1140 

AGAGAATATTTCCATAAACTGTTAATTCAGGGTGATTCTTCTGGAAGGTTGAATATT 
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ArgGluTyrPheHisLysLeuLeuIleGlnGlyAspSerSerGlyArgLeuAsnlle 

1150 1160 1170 1180 1190 

TGGAACATATCAGACACAGCTGATAAACAGGGAAGTGAAGAAGGGCTGGCAATGACA 
TrpAsnlleSerAspThrAlaAspLysGlnGlySerGluGluGlyLeuAlaMetThr 

1200 1210 1220 1230 1240 1250 

ACTTCTATTAGTTTGCAAGAGGCATTTGATAAACTGAATCCTTGTCCTGCTGGAATT 
ThrSerlleSerLeuGlnGluAlaPheAspLysLeuAsnProCysProAlaGlylle 

1260 1270 1280 1290 1300 1310 

ATAGATCAGCTGAGTGTGATTCCCAATAGTAATGAACCTCTTAAAGTAACTGCAAGT 
IleAspGlnLeuSerVallleProAsnSerAsnGluProLeuLysValThrAlaSer 

15 1320 1330 1340 1350 1360 

GTGTACATACCAGCACATGGACGACTTGTTTGTGGTCGTGAAGATGGAAGCATAGTT 
ValTyrlleProAlaHisGlyArgLeuValCysGlyArgGluAspGlySerlleVal 

1370 1380 1390 1400 1410 1420 

20 ATTGTACCTGCCACACAGACGGCCATAGTACAGCTGTTGCAAGGGGAACACATGCTC 
IleValProAlaThrGlnThrAlalleValGlnLeuLeuGlnGlyGluHisMetLeu 

1430 1440 1450 1460 1470 1480 

AGAAGAGGTTGGCCACCTCACAG AACACTCCGTGGTCATCGGAACAAAGTCACATGT 
25 ArgArgGlyTrpProProHisAr gThrLeuArqGlyHisArgAsnLysValThrCys 

1490 1500 1510 1520 1530 

TTGCTATATCCTCATCAGGTCTCAGCTCGGTATGATCAAAGATACCTGATATCTGGA 
LeuLeuTyrProHisGlnValSerAlaArgTyrAspGlnArgTyrLeuIleSerGly 
30 ~~ ~" ' ^ "~" — ■- * 

1540 1550 1560 1570 1580 1590 

GGTGTGGATTTTTCAGTCATAATTTGGGACA TATTTTCTGGAGAAATGAAACATATC 
GlyValAspPheSerValllelleTrpAsp IlePheSerGlyGluMetLysHisIle 

35 1600 1610 1620 1630 1640 1650 

TTCTGTGTTCATGGTGGTGAGATTACTCAACTTCTAGTTCCACCTGAAAACTGTAGT 
PheCysValHisGlyGlyGluIleThrGlnLeuLeuValProProGluAsnCysSer 

1660 1670 1680 1690 1700 1710 

40 GCAAGAGTACAGCACTGCATCTGCTCTGTAGCCAGTGACCACTCAGTAGGACTTCTA 
AlaArgValGlnHisCysIleCysSerValAlaSerAspHisSerValGlyLeuLeu 

1720 1730 1740 1750 1760 

AGTTTGCG AGAGAAAAAATGCATAATGTTGGCATCTCGTCACCTTTTTCCTATTCAA 
45 SerLeuAr gGluLysLysCysIleMetLeuAlaSerArgHisLeuPheProIleGln 

1770 1780 1790 1800 1810 1820 

GTAATCAAATGGAGGCCTTCTGATGATTACCTGGTGGTGGGGTGTTCAGATGGTTCT 

vmi a ^^^ a upMp^^iv«iai TC Y.»«^p6iy».» 

1830 1840 1850 1860 1870 1880 

GTGTACGTCTGGCAAA TGGATACTGGTGCATTGGATCGTTGTGTGATGGGGATAACA 
ValTyrValTrpGlnM etAspThrGlyAlaLeiiAspArgCysValMetGlvIleThr 

55 1890 1900 1910 ' 1920 1930 

GCAGTTGAGATTCTAAACGCTTGTGATGAAGCTGTTCCTGCTGCTGTTGATTCACTT 
AlaValGluIleLeuAsnAlaCysAspGluAlaValProAlaAlaValAspSerLeu 

1940 1950 1960 1970 1980 1990 

60 AGTCATCCAGCAGTCAACCTAAAACAAGCTATGACGAGACGTAGTCTTGCTGCTCTT 
SerHisProAlaValAsnLeuLysGlnAlaMetThrArgArgSerLeuAlaAlaLeu 

2000 2010 2020 2030 2040 2050 
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AAAAATATGGCCCATCATAAGCTACAAACCCTTGCAACTAACCTCTTGGCTTCTGAG 
LysAsnMetAlaHisHisLysLeuGlnThrLeuAlaThrAsnLeuLeuAlaSerGlu 

2060 2070 2080 2090 2100 

5 GCATCTGACAAGGGAAATTTACCTAAATATTCTCATAACTCCCTGATGGTTCAAGCA 
AlaSerAspLysGlyAsnLeuProLysTyrSerHisAsnSerLeuMetValGlnAla 

2110 2120 2130 2140 2150 2160 

AT AAAGAC AAACCTAACAGACCCGGAC ATACATGTGCT AT T C T T TGAT G TGGAAGCG 
10 IleLysThrAsnLeuThrAspProAspIleHisValLeuPhePheAspValGluAla 

2170 2180 2190 2200 2210 2220 

TTGATTATTCAACTCCTGACTGAAGAAGCCTCTAGGCCGAATACTGCTCTTATTTCC 

LeuIlelleGlnLeuLeuThrGluGluAlaSerArgProAsnThrAlaLeuIleSer 

15 

2230 2240 2250 2260 2270 2280 

CCAGAGAATTTGCAAAAAGCATCTGGCAGTTCAGACAAAGGGGGCTCTTTTTTAACT 
ProGluAsnLeuGlnLysAlaSerGlySerSerAspLysGlyGlySerPheLeuThr 

20 2290 2300 2310 2320 2330 

GGAAAACG AGCAGCAGTT CT CT TCCAACAAGT GAAAGAAACGAT CAAAG AGAAC AT C 
GlyLysArgAlaAlaValLeuPheGlnGlnValLysGluThrlleLysGluAsnlle 

2340 2350 2360 2370 2380 2390 

25 AAGG AACAC C T CCT T GAT GAT G AAGAGGAGG AT GAG G AG AT AAT GAG G C AG AG AAG6 

LysGluHisLeuLeuAspAspGluGluGluAspGluGluIleMetArgGlnArgArg 

2400 2410 2420 2430 2440 2450 

GAAGAAAGTGATCCTGAATATCGGTCCAGCAAATCAAAGCCATTGACCCTATTAGAA 
30 GluGluSerAspProGluTyrArgSerSerLysSerLysProLeuThrLeuLeuGlu 

2460 2470 2480 2490 2500 

TATAATTTAACTATGGACACTGCAAAGCTGTTTATGTCCTGCCTTCACGCCTGGGGT 
TyrAsnLeuThrMetAspThrAlaLysLeuPheMetSerCysLeuHisAlaTrpGly 

35 

2510 2520 2530 2540 2550 2560 

TTGAATGAAGTACTGGATGAAGTTTGCCTGGATCGCCTTGGAATGCTGAAACCCCAC 
LeuAsnGluValLeuAspGluValCysLeuAspArgLeuGlyMetLeuLysProHis 

40 2570 2580 2590 2600 2610 2620 

TGCACCGTATCGTTTGGCCTCTTGTCAAGAGGAGGCCATATGTCACTGATGCTGCCG 
CysThrValSerPheGlyLeuLeuSerArgGlyGlyHisMetSerLeuMetLeuPro 

2630 2640 2650 2660 2670 

45 GGTTATAATCAGCCTGCTTGTAAACTGTCACATGGGAAAACAGAAGTAGGAAGGAAG 
GlyTyrAsnGlnProAlaCysLysLeuSerHisGlyLysThrGluValGlyArgLys 

2680 2690 2700 2710 2720 2730 

CTGCCAGCGTCTGAGGGAGTAGGAAAGGGAACTTACGGAGTGTCCCGTGCCGTCACC 
50 LeuProAlaSerGluGlyValGlyLysGlyThrTyrGlyValSerArgAlaValThr 

2740 2750 2760 2770 2780 2790 

ACACAGCATCTCCTGTCTATCATTTCTTTGGCAAATACTTTAATGAGTATGACCAAT 

ThrGlnHisLeuLeuSerllelleSerLeuAlaAsnThrLeuMetSerMetThrAsn 

55 

2800 2810 2820 2830 2840 2850 

GCAAC T TT TATTGG TGATCATATGAAGAAGGG TCCTACCAGGCCACCT AG ACC AAGC 
AlaThrPhelleGlyAspHisMetLysLysGlyProThrArgProProArgProSer 

60 2860 2870 2880 2890 2900 

ACCCCAGACCTTTCTAAGGCAAGGGGTTCCCCTCCAACTTCCAGTAATATTGTGCAA 
ThrProAspLeuSerLysAlaArgGlySerProProThrSerSerAsnlleValGln 



59 



WO 01/66739 



PCT/US01/04475 



2910 2920 2930 2940 2950 2960 

GGACAGATTAAACAAGTTGCTGCACCTGTCGTTTCCGCTCGGTCTGATGCTGATCAC 
GlyGlnlleLysGlnValAlaAlaProValValSerAlaArgSerAspAlaAspHis 

5 2970 2980 2990 3000 3010 30,20 

TCTGGCTCTGACCCTCCTTCTGCTCCTGCTTTACATACCTGTTTCTTAGTAAATGAA 
SerGlySerAspProProSerAlaProAlaLeuHisThrCysPheLeuValAsnGlu 

3030 3040 3050 3060 3070 

10 GGTTGGAGTCAGTTAGCTGCTATGCACTGTGTTATGCTGCCAGACCTACTGGGATTG 
GlyTrpSerGlnLeuAlaAlaMetHisCysValMetLeuProAspLeuLeuGlyLeu 

3080 3090 3100 3110 3120 3130 

GATAAATTTAGGCCTCCCCTTCTGGAGATGCTGGCCCGAAGATGGCAAGATCGATGC 
15 AspLysPheArgProProLeuLeuGluMetLeuAlaArgArgTrpGlnAspArgCys 

3140 3150 3160 3170 3180 3190 

TTGGAGGTGAGAGAAGCCGCACAGGCCCTGCTTCTGGCGGAACTGAGAAGAATTGAG 
LeuGluValArgGluAlaAlaGlnAlaLeuLeuLeuAlaGluLeuArgArglleGlu 

20 

3200 3210 3220 3230 3240 

CAGGCAGGCAGGAAGGAAGCCATTGATGCCTGGGCTCCTTACTTACCTCAGTACATA 
GlnAlaGlyArgLysGluAlalleAspAlaTrpAlaProTyrLeuProGlnTyrlle 

25 3250 3260 3270 3280 3290 3300 

GACCACGTCATATCACCTGGAGTCACATCAGAAGCCGCGCAGACTATCACCACGGCT 
AspHisVallleSerProGlyValThrSerGluAlaAlaGlnThrlleThrThrAla 

3310 3320 3330 3340 3350 3360 

30 CCTGATGCCTCAGGGCCTGAAGCAAAAGTCCAGGAGGAAGAGCATGACCTTGTTGAC 
ProAspAlaSerGlyProGluAlaLysValGlnGluGluGluHisAspLeuValAsp 

3370 3380 3390 3400 3410 3420 

GATGACATCACCACTGGTTGCTTATCAAGTGTCCCACAAATGAAAAAAATTTCTACA 
35 AspAspIleThrThrGlyCysLeuSerSerValProGlnMetLysLysIleSerThr 

3430 3440 3450 3460 3470 

TCTTACGAGGAAAGACGGAAGCAAGCTACCGCTATTGTTTTACTTGGAGTAATAGGA 
SerTyrGluGluArgArgLysGlnAlaThrAlalleValLeuLeuGlyVallleGly 

40 

3480 3490 3500 3510 3520 3530 

GCTGAATTTGGTGCTGAAATTGAACCTCCTAAACTATTGACCAGACCTCGAAGCTCT 
AlaGluPheGlyAlaGluIleGluProProLysLeuLeuThrArgProArgSerSer 

45 3540 3550 3560 3570 3580 3590 

AGCCAAATTCCTGAGGGATTCGGGTTGACTAGTGGTGGATCCAACTACTCGCTGGCC 
SerGlnlleProGluGlyPheGlyLeuThrSerGlyGlySerAsnTyrSerLeuAla 

3600 3610 3620 3630 3640 

50 AGACATACTTGCAAGGCACTGACGTTTCTTCTGCTACAGCCTCCAAGCCCCAAACTT 
ArgHisThrCysLysAlaLeuThrPheLeuLeuLeuGlnProProSerProLysLeu 

3650 3660 3670 3680 3690 3700 

CCTCCACACAGCACTATCCGAAGAACAGCCATTGATCTGATTGGACGTGGGTTCACT 
55 ProProHisSerThrlleArgArgThrAlalleAspLeuIleGlyArgGlyPheThr 

3710 3720 3730 3740 3750 3760 

GTTTGGGAGCCTTACATGGATGTGTCCGCTGTTCTGATGGGGCTTCTCGAACTTTGT 
ValTrpGluProTyrMetAspValSerAlaValLeuMetGlyLeuLeuGluLeuCys 



60 



3770 3780 3790 3800 3810 

GCCGATGCCGAGAAACAACTTGCCAACATCACAATGGGGTTGCCTCTGAGCCCAGCA 
AlaAspAlaGluLysGlnLeuAlaAsnlleThrMetGlyLeuProLeuSerProAla 
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3820 3830 3840 3850 3860 3870 

GCTGACTCGGCCCGCTCTGCGAGGCATGCCCTCTCGCTCATTGCCACCGCCAGACCA 
AlaAspSerAlaArgSerAlaArgHisAlaLeuSerLeuIleAlaThrAlaArgPro 

3880 3890 3900 3910 3920 3930 

CCCGCCTTCATCACCACCATAGCCAAAGAGGTACACAGACATACGGCTCTTGCAGCA 

ProAlaPhelleThrThrlleAlaLysGluValHisArgHisThrAlaLeuAlaAla 

3940 3950 3960 3970 3980 3990 

AATAC C CAATCACAGCAGAATAT GCACACAACAAC TC T T GCACGAGC TAAAGGGGAA 
AsnThrGlnSerGlnGlnAsnMetHisThrThrThrLeuAlaArgAlaLysGlyGlu 

4000 4010 4020 4030 4040 

ATTTTGAGAGTCATTGAAATTCTTATTGAAAAGATGCCCACAGATGTTGTGGATCTT 
IleLeuArgVallleGluIleLeuIleGluLysMetProThrAspValValAspLeu 

4050 4060 4070 4080 4090 4100 

CTCGTGGAGGTTATGGACATCATTATGTACTGCCTTGAAGGATCTTTAGTTAAAAAG 
LeuValGluValMetAspIlelleMetTyrCysLeuGluGlySerLeuValLysLys 

4110 4120 -4130 4140 4150 4160 

AAAGGTCTTCAAGAATGTTTCCCAGCCATCTGCAGGTTCTACATGGTCAGCTATTAT 
LysGlyLeuGlnGluCysPheProAlalleCysArgPheTyrMetValSerTyrTyr 

4170 4180 4190 4200 4210 

GAGCGGAATCACAGAATAGCAGTTGGAGCTCGCCATGGTTCAGTGGCCCTGTACGAC 
GluArgAsnHisArglleAlaValGlyAlaArgHisGlySerValAlaLeuTyrAsp 

4220 4230 4240 4250 4260 4270 

ATCCGGACTGGAAAATGTCAGACAATCCATGGACACAAGGGACCAATCACTGCAGTG 
IleArgThrGlyLysCysGlnThrlleHisGlyHisLysGlyProIleThrAlaVal 

4280 4290 4300 4310 4320 4330 

GCTTTTGCTCCTGATGGAAGATATCTTGCCACCTACTCAAACACTGACAGCCACATT 
AlaPheAlaProAspGlyArgTyrLeuAlaThrTyrSerAsnThrAspSerHisIle 

4340 4350 4360 4370 4380 

TCTTTTTGGCAGATGAACACGTCACTGCTGGGAAGCATCGGCA,TGCTGAACTCGGCA 
SerPheTrpGlnMetAsnThrSerLeuLeuGlySerlleGlyMetLeuAsnSerAla 

4390 4400 4410 4420 4430 4440 

CCTCAGCTGCGCTGCATTAAAACCTACCAGGTGCCCCCTGTGCAGCCCGCGTCCCCC 
ProGlnLeuArgCysIleLysThrTyrGlnValProProValGlnProAlaSerPro 

4450 4460 4470 4480 4490 4500 

GGCTCCCACAATGCCCTCAAGCTGGCCCGGCTCATCTGGACTTCCAACCGCAACGTC 

GlySerHisAsnAlaLeuLysLeuAlaArgLeuIleTrpThrSerAsnArgAsnVal 

4510 4520 4530 4540 

ATCCTCATGGCCCATGACGGGAAGGAGCACCGCTTCATGGTCTAA 
IleLeuMetAlaHisAspGlyLysGluHisArgPheMetValSTP 



Table 3 shows the cDNA (SEQ ID NO: 5) and amino acid (SEQ ID NO: 6) 
sequences of mouse TRAG from brain as determined by double strand sequence analysis 
of PCR products amplified from mouse liver and brain tissue cDNA. The WD repeat 
elements are indicated by the underline, and the two tyrosine phosphorylation motifs are 
indicated in boldface type. 
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10 20 30 40 50 

TGAAGATTTGACAGGTTTGAAAACGCCATGGCAGGAAACAGCCTAGTTCTGCCCATT 

MetAlaGlyAsnSerLeuValLeu ProIle 

5 60 70 80 90 100 110 

GTTCTTTGGGGCCGCAAAGCACCCACACATTGCATTTCGTCAATACTGTTGAGAGAT 
ValLeuTrpGlyArqLysAlaProThrHisCysIleSerSerlleLeuLeuThrAsp 

120 130 140 150 160 170 

10 GATGGGGGCACAATTGTAACTGGATGCCACGATGGACAAATATGTCTCTGGGAT GTT 
AspGlyGlyThrlleValThrGlyCysHisAspGlyGlnlleCysLeuTrpAspV al 

180 190 200 210 220 

TCGGTAGAACTAGAAGTTAAT CCCCGAGCACTGTTATTTGGCCACACAGCATCCATC 
15 SerValGluLeuGluValAsn ProArqAlaLeuLeuPheGlyHisThrAlaSerlle 

230 240 250 260 270 280 

ACTTGTTTGTCAAAAGCCTGCGCTTCTGGGGACAAGCGGTACACTGTGAGCGCGTCT 
ThrCysLeuSerLysAlaCysAlaSerGlyAspLysArqTyrThrValSerAlaSer 

290 300 310 320 330 340 

GCAAACGGAGAGATGTGCCTCTGGGAT GTGAACGATGGCAGATGTATTGAATTTACC 
AlaAsnGlyGluMetCysLeuTrpAsp ValAsnAspGlyArgCysIleGluPheThr 

25 350 360 370 380 390 

AAGTTAGCCTGCACACACACTGGCATACAGTTCTACCAGTTCTCTGTTGGGAACCAG 
LysLeuAlaCysThrHisThrGlylleGlnPheTyrGlnPheSerValGlyAsnGln 

400 410 420 430 440 450 

30 CAAGAGGGCAGGCTCCTCTGCCATGGACATTACCCTGAAATCCTCGTTGTGGATGCC 
GlnGluGlyArgLeuLeuCysHisGlyHisTyrProGluIleLeuValValAspAla 

460 470 480 490 500 510 

ACCAGCCTTGAGGTGTTGTATTCCTTGGTATCGAAGATCTCTCCAGACTGGATCAGC 
35 ThrSerLeuGluValLeuTyrSerLeuValSerLysIleSerProAspTrpIleSer 

520 530 540 550 560 570 

TCCATGAGCATCATCCACTCTCAGCGGACACAAGAGGACACTGTGGTGGCGCTCTCT 
SerMetSerllelleHisSerGlnArgThrGlnGluAspThrValValAlaLeuSer 

40 

580 590 600 610 620 

GTGACAGGTATTCTGAAGGTGTGGATTGTCACCTCTGAAATGAGTGGAATGCAGGAT 
ValThrGlylleLeuLysValTrpIleValThrSerGluMetSerGlyMetGlnAsp 

45 630 640 650 660 670 680 

ACTGAGCCAATATTTGAGGAGGAATCCAAACCAATTTATTGTCAGAATTGCCAAAGC 
ThrGluProIlePheGluGluGluSerLysProIleTyrCysGlnAsnCysGlnSer 

690 700 710 720 730 740 

50 ATCTCTTTTTGTGCATTCACACAGAGGTCACTTTTGGTCGTATGCTCCAAATACTGG 
IleSerPheCysAlaPheThrGlnArgSerLeuLeuValValCysSerLysTyrTrp 

750 760 770 780 790 

AGGGTGTTTGATGCTGGCGACTACTCTCTGTTGTGCTCAGGTCCTAGTGAAAATGGA 
55 ArgValPheAspAlaGlyAspTyrSerLeuLeuCysSerGlyProSerGluAsnGly 

800 810 820 830 840 850 

CAG ACAT GG ACT G GAGGGGACT T TGT GT C T GCAGACAAAG T CATCATCT GGACAGAA 
GlnThrTrpThrGlyGlyAspPheValSerAlaAspLysValllelleTrpThrGlu 
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860 870 880 890 900 910 

AATGGGCAGAGTTACATCTACAAACTCCCTGCCAGTTGCCTTCCAGCTAGTGATTCA 
AsnGlyGlnSerTyrlleTyrLysLeuProAlaSerCysLeuProAlaSerAspSer 
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920 930 940 950 960 

TTCCGCAGCGACGTGGGGAAAGCAGTGGAAAATTTGATCCCTCCTGTGCAGCATAGC 
PheArgSerAspValGlyLysAlaValGluAsnLeuIleProProValGlnHisSer 

5 

970 980 990 1000 1010 1020 

CTCTTGGATCAGAAAGATAAAGAGTTGGTAATTTGTCCTCCTGTTACTCGGTTTTTC 
LeuLeuAspGlnLysAspLysGluLeuVallleCysProProValThrArgPhePhe 

10 1030 1040 1050 1060 1070 1080 

TACGGATGCAAGGAATATTTGCATAAGCTACTAATTCAGGGCGATTCTTCTGGAAGG 
TyrGlyCysLysGluTyrLeuHisLysLeuLeuIleGlnGlyAspSerSerGlyArg 

1090 1100 1110 1120 1130 1140 

1 5 TTAAATATCTGGAACATAGCAGACATAGCAGAGAAACAGGAAGCCGATGAAGGGCTA 
LeuAsnlleTrpAsnlleAlaAspIleAlaGluLysGlnGluAlaAspGluGlyLeu 

1150 1160 1170 1180 1190 

AAGATGACAACTTGTATTAGCTTGCAAGAGGCATTTGACAAGCTGAAGCCTTGCCCT 
20 LysMetThrThrCysIleSerLeuGlnGluAlaPheAspLysLeuLysProCysPro 

1200 1210 1220 1230 1240 1250 

GCTGGAATTATCGATCAGCTAAGTGTGATTCCCAACAGCAATGAACCTCTTAAAGTA 
AlaGlyllelleAspGlnLeuSerVallleProAsnSerAsnGluProLeuLysVal 

25 

1260 1270 1280 1290 1300 1310 

ACTGCGAGTGTCTACATACCAGCACACGGGCGACTTGTTTGTGGCCGGGAAGATGGA 
ThrAlaSerValTyrlleProAlaHisGlyArgLeuValCysGlyArgGluAspGly 

30 1320 1330 1340 1350 1360 

AGCATCATTATTGTGCCTGCCACCCAGACGGCCATAGTTCAGCTATTGCAAGGAGAA 
SerllellelleValProAlaThrGlnThrAlalleValGlnLeuLeuGlnGlyGlu 

1370 1380 1390 1400 1410 1420 

35 CACATGCTCAGAAGAGGTTGGCCCCCTCACAG AACCCTCCGTGGCCATCGGAACAAA 
HisMetLeuArgArgGlyTrpProProHisArg ThrLeuArgGlyHisArgAsnLys 

1430 1440 1450 1460 1470 1480 

GTCACGTGCCTGCTGTATCCCCATCAGGTCTCAGCTCGGTATGACCAAAGATACCTG 
40 ValThrCysLeuLeuTyrProHisGlnValSerAlaArgTyrAspGlnArgTyrLeu 

1490 1500 1510 1520 1530 

ATATCTGGAGGTGTGGATTTTTCCGTCATAATTTGGGACA TATTTTCTGGAGAAATG 
IleSerGlyGlyValAspPheSerValllelleTrpAsp IlePheSerGlyGluMet 

1540 1550 1560 1570 1580 1590 

AAACATATCTTCTGTGTTCATGGTGGTGAGATCACACAACTTCTGGTCCCCCCGGAA 
LysHisIlePheCysValHisGlyGlyGluIleThrGlnLeuLeuValProProGlu 

50 1600 1610 1620 1630 1640 1650 

AACTGTAGTGCAAGAGTTCAACACTGCATCTGTTCCGTAGCCAGTGACCACTCTGTA 
AsnCysSerAlaArgValGlnHisCysIleCysSerValAlaSerAspHisSerVal 

1660 1670 1680 1690 1700 1710 

55 GGCCTCCTAAGTCTGCG AGAGAAAAAATGCATCATGTTGGCGTCTCGCCACCTCTTT 
GlyLeuLeuSerLeuArg GluLysLysCysIleMetLeuAlaSerArgHisLeuPhe 

1720 1730 1740 1750 1760 

CCTATTCAGGTGATCAAGTGGAGGCCTTCCGACGACTACCTGGTGGTGGGATGCACA 
60 ProIleGlnVallleLysTrpArqProSerAspAspTyrLeuValValGlyCysThr 

1770 1780 1790 1800 1810 1820 

GACGGCTCTGTGTATGTCTGGCAGATGGACACTGGTGCGCTGGATCGTTGTGCAATG 
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AspGlySerValTyrValTrpGlnM etAspThrGlyAlaLeuAspArqCvsAlaMet 

1830 1840 1850 1860 1870 1880 

GGAATCACAGCAGTGGAGATACTGAACGCTTGTGACGAAGCCGTTCCTGCCGCTGTA 
GlylleThrAlaValGluIleLeuAsnAlaCysAspGluAlaValProAlaAlaVal 

1890 1900 1910 1920 1930 

GACTCTCTTAGTCATCCAGCAGTCAACCTGAAACAAGCCATGACAAGACGGAGTCTC 
AspSerLeuSerHisProAlaValAsnLeuLysGlnAlaMetThrArgArgSerLeu 

1940 1950 1960 1970 1980 1990 

GCTGCCCTTAAAAATATGGCCCACCATAAGCTGCAAACCCTTGCAACGAACCTTTTG 
AlaAlaLeuLysAsnMetAlaHisHisLysLeuGlnThrLeuAlaThrAsnLeuLeu 

2000 2010 2020 2030 2040 2050 

GCTTCTGAGGCCTCTGACT^AGGGTAATTTACCTAAATATTCCCATAACTCCCTGATG 
AlaSerGluAlaSerAspLysGlyAsnLeuProLysTyrSerHisAsnSerLeuMet 

2060 2070 2080 2090 2100 

GTTCAAGCAATAAAGACAAACC7AACAGACCCGGATATCCATGTGCTTTTCTTTGAT 
ValGlnAlalleLysThrAsnLeuThrAspProAspIleHisValLeuPhePheAsp 

2110 2120 2130 2140 2150 2160 

GTGGAAGCTTTGATTATTCAACTCCTGACTGAAGAAGCCTCTAGGCCGAATACTGCA 
ValGluAlaLeuIlelleGlnLeuLeuThrGluGluAlaSerArgProAsnThrAla 

2170 2180 2190 2200 2210 2220 

CTTATTTCCCCAGAGAATCTGCAGAAAGCATCTGGCAGTTCAGACAAAGGGGGCTCT 

LeuIleSerProGluAsnLeuGlnLysAlaSerGlySerSerAspLysGlyGlySer 

2230 2240 2250 2260 2270 2280 

TTCCTAACTGGGAAGCGAGCAGCGGTTCTTTTCCAGCAAGTGAAAGAAACTATCAAA 
PheLeuThrGlyLysArgAlaAlaValLeuPheGlnGlnValLysGluThrlleLys 

2290 2300 2310 2320 2330 

GAGAACATAAAGGAACACCTTCTGGATGAGGAGGAGGACGAGGAGGAGGCGAGGAGG 
GluAsnlleLysGluHisLeuLeuAspGluGluGluAspGluGluGluAlaArgArg 

2340 2350 2360 2370 2380 2390 

CAGAGCAGGGAGGAT AGTGACCCTGAGTACC GGGCCAG C AAG T CCAAG C C AC T G ACC 
GlnSerArgGluAspSerAspProGluTyrArgAlaSerLysSerLysProLeuThr 

2400 2410 2420 2430 2440 2450 

CTACTAGAATACAACCTTACTATGGATACCGCAAAATTATTCATGTCCTGTCTCCAC 
LeuLeuGluTyrAsnLeuThrMetAspThrAlaLysLeuPheMetSerCysLeuHis 

2460 2470 2480 2490 2500 

GCCTGGGGCTTGAATGAAGTTCTGGATGAAGTTTGTCTCGATCGCCTCGGAATGCTG 
AlaTrpGlyLeuAsnGluValLeuAspGluValCysLeuAspArgLeuGlyMetLeu 

2510 2520 2530 2540 2550 2560 

AAACCGCACTGCACGGTGTCCTTTGGTCTCCTATCGAGAGGAGGTCATATGTCGCTG 
LysProHisCysThrValSerPheGlyLeuLeuSerArgGlyGlyHisMetSerLeu 

2570 2580 2590 2600 2610 2620 

ATGCTTCCCGGTTATAATCAGGCTGCTGGAAAACTGCTGCATGCGAAAGCAGAAGTA 
MetLeuProGlyTyrAsnGlnAlaAlaGlyLysLeuLeuHisAlaLysAlaGluVal 

2630 2640 2650 2660 2670 

GGAAGGAAGCTGCCAGCAGCGGAGGGCGTAGGCAAGGGAACCTACACAGTGTCTCGA 
GlyArgLysLeuProAlaAlaGluGlyValGlyLysGlyThrTyrThrValSerArg 

2680 2690 2700 2710 2720 2730 
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GCGGTCACCACTCAGCATCTGTTGTCCATCATATCCCTGGCGAATACTTTAATGAGT 
AlaValThrThrGlnHisLeuLeuSerllelleSerLeuAlaAsnThrLeuMetSer 

2740 2750 2760 2770 2780 2790 

5 ATGACCAATGCAACTTTCATTGGCGATCACATGAAGAAGGGCCCCACCAGGCCGCCT 
MetThrAsnAlaThrPhelleGlyAspHisMetLysLysGlyProThrArgProPro 

2800 2810 2820 2830 2840 2850 

AGACCTGGCACCCCAGACCTCTCTAAGGCGAGGGATTCCCCTCCAGCCTCCAGTAAC 
10 ArgProGlyThrProAspLeuSerLysAlaArgAspSerProProAlaSerSerAsn 

2860 2870 2880 2890 2900 

ATTGTGCAAGGACAGATTAAACAAGCCGCTGCGCCTGTCGTTTCTGCTCGGTCTGAC 
IleValGlnGlyGlnlleLysGlnAlaAlaAlaProValValSerAlaArgSerAsp 

15 

2910 2920 2930 2940 2950 2960 

GCTGATCACTCTGGCTCTGACTCTGCCTCTCCTGCTTTACCTACCTGTTTCTTAGTA 
AlaAspHisSerGlySerAspSerAlaSerProAlaLeuProThrCysPheLeuVal 

20 2970 2980 2990 3000 3010 3020 

AATGAAGGCTGGAGCCAGTTAGCCGCCATGCACTGTGTTATGTTGCCTGACCTGCTG 
AsnGluGlyTrpSerGlnLeuAlaAlaMetHisCysValMetLeuProAspLeuLeu 

3030 3040 3050 3060 3070 

25 GGGCTGGAGAGATTCAGGCCTCCTCTCCTGGAGATGCTAGCTCGAAGATGGCAGGAC 
GlyLeuGluArgPheArgProProLeuLeuGluMetLeuAlaArgArgTrpGlnAsp 

3080 3090 3100 3110 3120 3130 

CGATGCTTGGAGGTGAGAGAGGCTGCACAGGCCCTGCTTCTAGCTGAGCTGAGAAGG 
30 ArgCysLeuGluValArgGluAlaAlaGlnAlaLeuLeuLeiiAlaGluLeuArgArg 

3140 3150 3160 3170 3180 3190 

ATTGAGCAGGCAGGACGGAAGGAGACGATCGACACCTGGGCTCCTTACTTACCTCAG 
IleGluGlnAlaGlyArgLysGluThrlleAspThrTrpAlaProTyrLeuProGln 

35 

3200 3210 3220 3230 3240 

TATATGGACCATGTCATATCACCTGGCGTCACGGCGGAAGCCATGCAGACTATGGCA 
TyrMetAspHisVallleSerProGlyValThrAlaGluAlaMetGlnThrMetAla 

40 3250 3260 3270 3280 3290 3300 

GCTGCTCCAGATGCCTCGGGCCCAGAAGCCAAAGTCCAGGAAGAAGAGCATGACCTG 
AlaAlaProAspAlaSerGlyProGluAlaLysValGlnGluGluGluHisAspLeu 

3310 3320 3330 3340 3350 3360 

45 GTGGATGATGACATCACTGCAGGTTGCTTGTCAAGTGTCCCACAGATGAAAAAAATC 
ValAspAspAspIleThrAlaGlyCysLeuSerSerValProGlnMetLysLysIle 

3370 3380 3390 3400 3410 3420 

TCCACGTCTTACGAAGAAAGAAGGAAGCAGGCCACGGCTATTGTTCTCCTGGGAGTG 
50 SerThrSerTyrGluGluArgArgLysGlnAlaThrAlalleValLeuLeuGlyVal 

3430 3440 3450 3460 3470 

ATAGGGGCAGAGTTTGGAGCTGAAATTGAACCGCCGAAGCTGCTGACCAGGCCTCGA 
IleGlyAlaGluPheGlyAlaGluIleGluProProLysLeuLeuThrArgProArg 

55 

3480 3490 3500 3510 3520 3530 

AGCTCCAGTCAAATCCCTGAAGGCTTTGGCTTGACCAGCGGAGGCTCCAACTACTCC 
SerSerSerGlnlleProGluGlyPheGlyLeuThrSerGlyGlySerAsnTyrSer 

60 3540 3550 3560 3570 3580 3590 

CTGGCCAGACACACGTGCAAGGCACTGACATATCTTCTGCTACAGCCACCAAGTCCC 
LeiaAlaArgHisThrCysLysAlaLeuThrTyrLeuLeuLeuGlnProProSerPro 
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3600 3610 3620 3630 3640 

AAGCTCCCTCCTCATAGCACCATCCGGAGAACTGCCACTGACCTGATTGGGCGAGGG 
LysLeuProProHisSerThrlleArgArgThrAlaThrAspLeuIleGlyArgGly 

5 3650 3660 3670 3680 3690 3700 

TTCACTGTATGGGAGCCCTACATGGATGTGTCCGCTGTCCTGATGGGGCTGCTGGAG 
PheThrValTrpGluProTyrMetAspValSerAlaValLeuMetGlyLeuLeuGlu 

3710 3720 3730 3740 3750 3760 

10 CTCTGTGCAGACGCTGAGAAACAACTTGCCAACATCACAATGGGGCTGCCTCTGAGC 
LeuCysAlaAspAlaGluLysGlnLeuAlaAsnlleThrMetGlyLeuProLeuSer 

3770 3780 3790 3800 3810 

CCTGCAGCTGACTCTGCCCGCTCCGCAAGGCACGCCCTGTCTCTCATCGCCACAGCC 
15 ProAlaAlaAspSerAlaArgSerAlaArgHisAlaLeuSerLeuIleAlaThrAla 

3820 3830 3840 3850 3860 3870 

AGACCACCCGCCTTCATCACCACCATAGCTAAGGAGGTCCACAGACACACGGCCCTT 
ArgProProAlaPhelleThrThrlleAlaLysGluValHisArgHisThrAlaLeu 

20 

3880 3890 3900 3910 3920 3930 

GCAGCGAACACCCAGTCCCAGCAGAGTATACACACCACCACCCTGGCGAGGGCTAAA 

AlaAlaAsnThrGlnSerGlnGlnSerlleHisThrThrThrLexaAlaArgAlaLys 

25 3940 3950 3960 3970 3980 3990 

GGCGAAATCCTGAGAGTCATTGAAATTCTCATTGAAAAGATGCCCACGGATGTGGTG 
GlyGluIleLeuArgVallleGluIleLeuIleGluLysMetProThrAspValVal 

4000 4010 4020 4030 4040 

30 GATCTTCTTGTGGAGGTGATGGATATCATCATGTACTGCCTGGAAGGATCTTTAGTG 
AspLeuLeuValGluValMetAspIlelleMetTyrCysLeuGluGlySerLeuVal 

4050 4060 4070 4080 4090 4100 

AAAAAGAAGGGTCTTCAGGAGTGTTTTCCAGCTATCTGCAGGTTCTACATGGTCAGC 
35 LysLysLysGlyLeuGlnGluCysPheProAlalleCysArgPheTyrMetValSer 

4110 4120 4130 ■ 4140 4150 4160 

TATTATGAGCGCAGTCACAGAATCGCAGTTGGAGCACGCCATGGCTCAGTAGCCCTG 
TyrTyrGluArgSerHisArglleAlaValGlyAlaArgHisGlySerValAlaLeu 

40 

4170 4180 4190 4200 4210 

TAT GAC ATCCGGAC T GGAAAAT GTCAGACAAT CCATGGGC ACAAGG GACCAAT C ACG 
TyrAspIleArgThrGlyLysCysGlnThrlleHisGlyHisLysGlyProIleThr 

45 4220 4230 4240 4250 4260 4270 

GCGGTGTCCTTTGCTCCTGATGGCCGTTACCTTGCCACCTACTCGAACACCGACAGC 
AlaValSerPheAlaProAspGlyArgTyrLeuAlaThrTyrSerAsnThrAspSer 

4280 4290 4300 4310 4320 4330 

50 CACATTTCCTTCTGGCAGATGAACACCTCACTCCTGGGAAGCATTGGAATGCTGAAC 
HisIleSerPheTrpGlnMetAsnThrSerLeuLeuGlySerlleGlyMetLeuAsn 

4340 4350 4360 4370 4380 

TCAGCTCCTCAGCTGCGCTGCATCAAGACCTACCAGGTCCCTCCAGTGCAGCCAGCA 
55 SerAlaProGlnLeuArgCysIleLysThrTyrGlnValProProValGlnProAla 

4390 4400 4410 4420 4430 4440 

TCCCCTGGCTCCCACAACGCCCTTAAGTTGGCCCGGCTCATCTGGACTTCCAACCGG 
SerProGlySerHisAsnAlaLeuLysLeuAlaArgLeuIleTrpThrSerAsnArg 



60 



4450 4460 4470 4480 4490 

AATGTTATCCTCATGGCCCATGATGGGAAGGAACACCGCTTCATGGTCTGA 

AsnVallleLeuMetAlaHisAspGlyLysGluHisArgPheMetValSTP 
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Table 4 shows a comparison of the amino acid sequence of the rat brain (SEQ ID 
NO: 2), murine brain (SEQ ID NO: 6), and human brain (SEQ ID NO: 4) TRAG 
5 proteins. The sequence specific only to brain TRAG is at positions 95 1 to 98 1 in the rat 
(top), 951 to 980 in the mouse (middle), and 951 to 982 in the human (bottom). 

1 MAGNSLVLPIVLWGRKAPTHCISSILLTDDGGTIVTGCHDGQICLWDVSVELEVNPRALL 60 

1 MAGNSLVLPIVLWGRKAPTHCISSILLTDDGGTIVTGCHDGQICLWDLSEELEVNPRALL 60 

1 MAGNSLVLPIVLWGRKAPTHCISAVLLTDDGATIVTGCHDGQICLWDLSVELQVNPRALL 60 

10 

61 FGHTASITCLSKACASGDKRYTVSASANGEMCLWDVNDGRCIEFTKLACTHTGIQFYQFS 120 

61 FGHTAAITCLSKACASGDKQYTVSASANGEMCLWDVNDGRCIEFTKLACTHTGIQFYQFS 120 

61 FGHTASITCLSKACASSDKQYIVSASESGEMCLWDVSDGRCIEFTKLACTHTGIQFYQFS 120 

15 121 VGNQQEGRLLCHGHYPEILWDATSLEVLYSLVSKISPDWISSMSIIHSQRTQEDTVVAL 180 

121 VGNQREGRLLCHGH YPEILWDATSLEVL YSLVSKI S PDWI SSMS I IRSHRTQEDT WAL 180 

121 VGNQREGRLLCHGHYPEILWDATSLEVLYSLVSKI S PDWI S SMS I IRSHRTQEDT WAL 180 

181 SVTGILKVWIVTSEMSGMQDTEPIFEEESKPIYCQNCQSISFCAFTQRSLLVVCSKYWRV 240 

20 181 SVTGILKVWIVTSEISGLQDTEPIFEEESKPIYCQNCQSLSFCAFTQRSLLWCSKYWRV 240 

181 SVTGILKVWIVTSEISDMQDTEPIFEEESKPIYCQNCQSISFCAFTQRSLLWCSKYWRV 240 

241 FDAGDYSLLCSGPSENGQTWTGGDFVSADKVIIWTENGQSYIYKLPASCLPASDSFRSDV 300 

241 FDAGDYSLLCSGPSEDGQTWTGGDFVSADKVIIWTENGQSYIYKLPASCLPASDSFRSDV 300 

25 241 FDAGDYSLLCSGPSENGQTWTGGDFVSSDKVIIWTENGQSYIYKLPASCLPASDSFRSDV 300 

301 GKAVENLI PP VQH SLLDQKDKELVI CPPVTRFFYGCKEYLHKLLI QGDS SGRLNI WNI AD 360 

301 GKAVENLI PPVQHSLLDQKDRELVICPPVTRFFYGCKEYLHKLLIQGDSSGRLSIWNIAD 360 

301 GKAVENLI PPVQHILLDRKDKELLICPPVTRFFYGCREYFHKLLIQGDSSGRLNIWNISD 360 

30 

361 I AE KQE ADE GLKMT T C I S LQE AFDKLKP C PAG 1 1 DQLSVI PNSNEPLKVTASVYI PAHGR 420 

361 IADKQEANEGLKTTTCI SLQDAFDKLKPCPAGI I DQLSVI PNSNEPLKVTASVYI PAHGR 420 

361 TADKQGSEEGLAMTTSISLQEAFDKLNPCPAGI I DQLSVI PNSNEPLKVTASVYI PAHGR 420 

35 421 LVCGREDGSIIIVPATQTAIVQLLQGEHMLRRGWPPHRTLRGHRNKVTCLLYPHQVSARY 480 

421 LVCGREDGSI I IVPATQTAIVQLLQGEHMLRRGWPPHRTLRGHRNKVTCLLYPHQVSARY 480 

421 LVCGREDGSIVIVPATQTAIVQLLQGEHMLRRGWPPHRTLRGHRNKVTCLLYPHQVSARY 480 

481 DQRYLISGGVDFSVIIWDIFSGEMKHIFCVHGGEITQLLVPPENCSARVQHCICSVASDH 540 

40 481 DQRYLISGGVDFSVIIWDIFSGEMKHIFCVHGGEITQLLVPPENCSARVQHCVCSVASDH 540 

481 DQRYLISGGVDFSVIIWDIFSGEMKHIFCVHGGEITQLLVPPENCSARVQHCICSVASDH 540 

541 SVGLLSLREKKCIMLASRHLFPIQVIKWRPSDDYLWGCTDGSVYVWQMDTGALDRCAMG 600 

541 SVGLLSLREKKCIMLASRHLFPIQVIKWRPSDDYLWGCTDGSVCVWQMDTGALDRCAMG 600 

45 541 SVGLLSLREKKCIMLASRHLFPIQVIKWRPSDDYLWGCSDGSVYVWQMDTGALDRCVMG 600 

601 ITAVEILNACDEAVPAAVDSLSHPAVNLKQAMTRRSLAALKNMAHHKLQTLATNLLASEA 660 

601 ITAVEILNACDEAVPAAVDSLSHPAVNLKQAMTRRSLAALKNMAHHKLQTLATNLLASEA 660 

601 ITAVEILNACDEAVPAAVDSLSHPAVNLKQAMTRRSLAALKNMAHHKLQTLATNLLASEA 660 

50 

661 SDKGNLPKYSHNSLMVQAIKTNLTDPDIHVLFFDVEALIIQLLTEEASRPNTALISPENL 720 

661 SDKGNLPKYSHNSLMVQAIKTNLTDPDIHVLFFDVEALIIQLLTEEASRPNTALISPENL 720 

661 SDKGNLPKYSHNSLMVQAIKTNLTDPDIHVLFFDVEALI IQLLTEEASRPNTALI SPENL 720 

55 721 QKASGS S DKGGS FLTGKRAAVLFQQVKETI KENI KEHLLDEEEDEEEARRQSREDS DPE Y 780 

721 QKASGSSDKGGSFLTGKRAAVLFQQVKETIKENIKEHLLDEEEDEEEVMRQRREESDPEY 780 

721 QKASGSS DKGGS FLTGKRAAVLFQQVKETI KEN I KEHLLDDEEE DEE IMRQRREES DPE Y 780 

781 RASKSKPLTLLEYNLTMDTAKLFMSCLHAWGLNEVLDEVCLDRLGMLKPHCTVSFGLLSR 840 
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781 RASKSKPLTLLEYNLTMDTAKLFMSCLHAWGLNEVLDEVCLDRLGMLKPHCTVSFGLLSR 840 

781 RSSKSKPLTLLEYNLTMDTAKLFMSCLHAWGLNEVLDEVCLDRLGMLKPHCTVSFGLLSR 840 

841 GGHMSLMLPGYNQAAGKLLHAKAEVGRKLPAAEGVGKGTYTVSBIAVTTQHLLSIISLANT 900 

5 841 GGHMSLMLPGYNQAAGKLLQAKAEAGRKGPATESVGKGT YTVSRAVTTQHLLS 1 1 SLANT 900 

841 GGHMSLMLPGYNQPACKLSHGKTEVGRKLPASEGVGKGTYGVSRAVTTQHLLSIISLANT 900 

901 LMSMTNATFIGDHMKKGPTRPPRPGTPDLSKARDSPPASSNIVQGQIKQAAAPWSARSD 960 

901 LMSMTNATFIGDHMKKGPTRPPRPGTPDLSKARDSPPASSNIVQGQIKQAAAP-VSARSA 959 

10 901 LMSMTNATFIGDHMKKGPTRPPRPSTPDLSKARGSPPTSSNIVQGQIKQVAAPVVSARSD 960 

961 ADHSGSDS-ASPALPTCFLVNEGWSQLAAMHCVMLPDLLGLERFRPPLLEMLARRWQDRC 1019 

960 ADHSGSAS-ASPALRTCFLVNEGWSQLAAMHCVMLPDLLGLGKFRPPLLEMLARRWQDRC 1018 

961 ADHSGSDPPSAPALHTCFLVNEGWSQLAAMHCVMLPDLLGLDKFRPPLLEMLARRWQDRC 1020 

15 

1020 LEVREAAQALLLAELRRI EQAGRKET I DTWAP YLPQYMDHVI S PG VTAEAMQTMAAAPDA 1079 
1019 LEVREAAQALLLAELRRI EQAGRKET I DTWAP YLPQYMDHVI SPGVTAEAMQTMAAAPDA 1078 

1021 LEVREAAQALLLAELRRI EQAGRKEAI DAWAPYL PQYI DH VI SPG VTSEAAQTITTAPDA 1080 

20 1080 SGPEAKVQEEEHDLVDDDITAGCLSSVPQMKKISTSYEERRKQATAIVLLGVIGAEFGAE 1139 

1079 SGPEAKVQEEEHDLVDDDITTGCLSSVPQMKKMSTSYEERRKQATAIVLLGVIGAEFGAE 1138 

1081 SGPEAKVQEEEHDLVDDDITTGCLSSVPQMKKISTSYEERRKQATAIVLLGVIGAEFGAE 1140 

1140 IEPPKLLTRPRSSSQIPEGFGLTSGGSNYSLARHTCKALTYLLLQPPSPKLPPHSTIRRT 1199 
•25 1139 IEPPKLLTRPRSSSQIPEGFGLTSGGSNYSLARHTCKALTFLLLQPPSPKLPPHSTIRRT 1198 

1141 IEPPKLLTRPRSSSQIPEGFGLTSGGSNYSLARHTCKALTFLLLQPPSPKLPPHSTIRRT 1200 

1200 ATDLIGRGFTVWEPYMDVSAVLMGLLELCADAEKQLANITMGLPLSPAADSARSARHALS 1259 

1199 AIDLIGRGFTVWEPYMDVSAVLMGLLELCADAEKQLANITMGLPLSPAADSARSARHALS 1258 

30 1201 AIDLIGRGFTVWEPYMDVSAVLMGLLELCADAEKQLANITMGLPLSPAADSARSARHALS 1260 

1260 LIATARPPAFITTIAKEVHRHTALAANTQSQQSIHTTTLARAKGEILRVIEILIEKMPTD 1319 
1259 LIATARPPAFITTIAKEVHRHTALAANTQSQQSIHTTTLARAKGEILRVIEILIEKMPTD 1318 

1261 LIATARPPAFITTIAKEVHRHTALAANTQSQQNMHTTTLARAKGEILRVIEILIEKMPTD 1320 

35 

1320 WDLLVEVMDIIMYCLEGSLVKKKGLQECFPAICRFYMVSYYERSHRIAVGARHGSVALY 1379 
1319 VVDLLVEVMDI IMYCLEGSLVKKKGLQECFPAICRFYMVS YYERSHRIAVGARHGSVALY 1378 

1321 VVDLLVEVMDI IMYCLEGSLVKKKGLQECFPAICRFYMVS YYERNHRIAVGARHGSVALY 1380 

40 1380 DIRTGKCQTIHGHKGPITAVSFAPDGRYLATYSNTDSHISFWQMNTSLLGSIGMLNSAPQ 1439 

137 9 DIRTGKCQTIHGHKGPITAVSFAPDGRYLATYSNTDSHISFWQMNTSLLGSIGMLNSAPQ 1438 

1381 DIRTGKCQTIHGHKGPITAVAFAPDGRYLATYSNTDSHISFWQMNTSLLGSIGMLNSAPQ 1440 

1440 LRCIKTYQVPPVQPASPGSHNALKLARLIWTSNRNVILMAHDGKEHRFMV 1489 

45 1439 LRC I KT YQ VP P VQP AS PGS HNALRLARL I WT SNRNVI LMAH DGKEHRFMV 1488 

14 41 LRCIKTYQVPPVQPASPGSHNALKLARLIWTSNRNVILMAH DGKEHRFMV 1490 



Table 5 shows a comparison of the amino acid sequence of murine (SEQ ID NO: 
50 6) and Drosophila TRAG (SEQ ID NO:7) proteins. Drosophila protein sequence 
obtained by translating noncontiguous regions of Drosophila genomic sequence 
(GenBank Accession Number AL021086). 

1 MAGNSLVLPIVLWGRKAPTHCISSILLTDDGGTIVTGCHDGQICLWDVS-VELEVNPRAL 59 

1 MVSTNLWPWLWGPTAPTHCISSVFLSDDQFTLVTGCYDGQIXLWQVEPTTLKMSPRCL 60 

55 

60 LFGHTASITCLSKACASGDKRYTVSASANGEMCLWDVNDGRCIEFTKLACTHTGIQFYQF 119 

61 LVGHSAPVLCLVRASLLPENNFLVSSSENGEMCTWDLTDGKCMEAVKLPQVHTQIQSYHT 120 

120 SVGNQQEGRLLCHGHYPEILWDATSLEVLYSLVSKISPDWISSMSIIHSQRTQEDTWA 179 
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121 — ANSEDVRLFCIGYYAEIMVMDPFSLEVIYVLSSKVKPDWISAIHVLRPMRRKDDVVLA 178 

180 LSVTGILKVWIVTSEMSGMQDTEPIFEEESKPIYCQNCQSISFCAFTQRSLLWCSKYWR 239 
179 ITTTGTVKVWTLTGNEN — KHAEPIYENESKEIRCLNAITMNCCAQNQRTVLLVCTKYWQ 236 

240 VFDAGDYSLLCSGPSENGQTWTGGDFVSADKVIIWTENGQSYIYKLPASCLPASDSFRSD 299 
237 IYDAGDFTVLCSVIAPARERWQGGDFITSDRVMLWTDEGKGYLYKLPANCIPDNKEFHS- 295 

300 VGKAVENLIPPVQHSLLDQKDKELVICPPVTRFFYGCKEYLHKLLIQGDSSGRLNIWNIA 359 
296 — KSVVRDAPYLYYVLQHAGDK-VLSCPPAMKLLQGAGG-QHNLLR-GDSEGYISWNVP 350 

360 DIAE KQEADEGLKMTTCISLQEAFDKLKPCPAGIIDQLSVIPNSNEPLKVTA 411 

351 EVPLDNISILQAKQMPPRPLKPHVCTSLVEAWSIMDPPPVGILDQLSRITES — PVKLTS 408 

412 SVYIPAHGRLVCGREDGSIIIVPATQTAIVQLLQGEHMLRRGWPPHRTLRGHRNKVTCLL 471 
409 S I YLPQQSRLVI GREDGS I VI VPATQTVMMQLLVGI KQN FS DWPS HQI L YGHRGRVNCLL 4 68 

472 YPHQVSARYDQRYLI SGGVDFSVI IWDI FSGEMKHI FCVHGGEITQLLVPPENCSARVQH 531 
4 69 CPSMIHSRYEKSHLLSGGIDFAVCLWDLYSGSLLHRFCVHAGEITQLLVPPESCSPRILK 528 

532 CICSVASDHSVGLLSLREKKCIMLASRHLFPIQVIKWRPSDDYLVVGCTDGSVYVWQMDT 591 
529 CICSVASDHSVTLVSLQERKCVTLASRHLFPVVTIKWAPLDDFLIVGCSDGSVYVWQMET 588 

592 GAL DRCAMG I TA VE I LNAC DEAVP AA VDSLSH PAVNLKQAMTRRSL 637 

589 GHLDRVLHGMLAEEVLSACDEQAEDGGSGGGGGSNGASASEMGMANPAVHFFRGLKSRNM 64 8 

638 AALKNMAHHKLQTLATNLLASEASDKGN LPKYSHNSLMVQAIKTNLTDPDIHVLFFD 694 

64 9 NAIR HATQRGITQLQQLQGHNQGNFDFLMKHRSNPLVIQGLRTNPKDAESHILFFD 704 

695 VEALI IQLLTEEASRPNT ALI S PENLQKASGS SDKGGS FLTGKRAAVLFQQVK E 748 

705 IEGLIFELHSEEYAQMTPATLESLGVHLQNPKDGKSMHLDASKKIGDFFNKVKNKAVDVE 764 

74 9 T I KEN I KEHIiLDEE E DEE EARRQSREDS DP — EYRASKSKPLTLLEYNLTM 797 

7 65 KILKDKDKHGLVQKFKEKTEIVEKKVQAKVESLQKAVEPHEEQQDLKSKIASKMEVTHVM 824 

798 DTAKLFMSCLHAWGLNEVLDEVCLDRLGMLKPHCTVSFGLLSRGGHMSLMLP GYNQ 853 

825 EVAQLLLSLLHSWGLDPHLDKMCETRLGLLRPIVPISYGVLSKAGYMSLLLPTWQNNYAI 884 

8 5 4 AAGKLLHAKAEVGRKLP AAEGVGKGT YTVSRAVTTQHLLS 1 1 SLANTLMSM 904 

885 PPGIQLPSSSKK-RPLPEELQRLEHLTAVFTSRLHWELSTTLTTNHILALVAMSNTLLSM 943 

905 TNATFIGDHMKKGPTRPPRPGTPDLSKARDSPPASSNIVQGQIKQAAAPWSARSDADHS 964 
94 4 SAAS FL PDS EK — HKKLQR LAQRTDS — TLSNEEE REELMAHHI 983 

965 GS DSAS PAL PTCFLVNEGWSQLAAMHCVMLPD-LLGLE — RFRPPLLEMLARRWQDRCLE 1021 
98 4 SQ 1 KHAWS LLAT HHC FLLP DKI EALEPKKFKRPQVEMMVKRWQHHC I E 1031 

1022 VREAAQALLLAELRRIEQAGRKETI DTWAPYLPQYMDHVI SPGVTAEAMQTMAAAPDASG 1081 
1032 IREAAQQILLGELTRMGKKGRKQLVESWAQYLPLYT-HT-EPIVGAQQQLALISQPASGG 108 9 

1082 PEAKVQEEEHDLVDDDIT AGCLS SVP Q MKKISTSYE 1117 

1090 — AGSGSGGNGGVGVGVSGGGGAGSGSGPGGSVPGGDAHQDEDYEEEEEEIIRKPSSLSE 1147 

1118 ERRKQ AT AI VLLG V I GAE FG AE I — E P P KLLTR PRSSSQIPEGF 1159 

114 8 LKRKQTTAVILLGVIGAEFGQDISQESPNHRGSISMATGANLTSGVAGGERRKSSWEGF 1207 

1160 GLTSGGSNYSLARHTCKALTYLLLQPPSPKLPPHSTIRRTATDLIGRGFTVWEPYMDVSA 1219 
1208 GIAN NLARLTSMALAHLLYAPPSPKLPQYTPLRRAAIDLLGRGFTVWEPYLDVSK 1262 

1220 VLMGLLELCADAEKQLANITMGLPLSPAADSARSARHALSLIATARPPAFITTIAKEVHR 127 9 
1263 VLLGLLE I S CEG- KAVPNLN YKL PLT PQADACRTARHALRL I ATARPAAFI TTMARE VAR 1321 

1280 HTALAANTQSQQS-IHTTTLARAKGEILRVIEILIEKMPTDWDLLVEVMDIIMYCLEGS 1338 
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1322 YNTMQQNAQS INT PLTQS VLHKAKGE I LQC VEML I DKMQS E I AGLL VE VMD I ALHC VDGN 13 8 1 

1339 LVKKKGLQECFPAICRFYMVSYYERSHRIAVGARHGSVALYDIRTGKCQTIHGHKGPITA 1398 
1382 ELKNRGLAELCPAICKFNQISHCAQTRRIAVGANSGNLAIYELRQNKCQMIPAHTHPITS 1441 

5 

1399 VS FAP DG R YL AT YS NT D S H I S FWQMNT S LLG S I GMLNS APQLRC I KT YQ VP P VQ PAS PG S 1458 
1442 LAFSPDGKYLVSYSCAENRLSFWQTSTGMFG LGQS-QTRCTKGYSTAPI PDVS — R 14 94 

1459 HNALKLARLIWTSNRNVILMAHDGKEHRFMV 1489 
10 1495 LNPMRLAKLVWINNRTVTLMLADGSETRFNV 1525 
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The Claimed Invention Is: 

1 . An isolated polypeptide comprising a TRAG polypeptide fragment, said 
fragment comprising a fragment of an amino acid sequence as shown in Table 1, 2, or 3 

2. The TRAG polypeptide fragment of claim 1 , wherein said TRAG polypeptide 
fragment is a full-length TRAG polypeptide comprising an amino acid sequence as 
shown in Table 1, 2, or 3. 

3 . The TRAG polypeptide fragment of claim 1 , further comprising a tyrosine 
phosphorylation motif. 

4. The TRAG polypeptide fragment of claim 1 , further comprising a WD-repeat- 
element motif. 

5. The TRAG polypeptide fragment of claim 1 joined to a detectable label. 

6. The TRAG polypeptide fragment of claim 5, wherein the detectable label includes a 
radioactive isotope, an enzyme, a chromophore, or a mixture thereof. 

7. An isolated nucleic acid sequence encoding a TRAG polypeptide fragment, said 

TRAG 

fragment comprising a fragment of an amino acid sequence as shown in Table 1, 2, or 3 

8. The nucleic acid sequence of claim 7, comprising a nucleotide sequence coding for 
an amino acid sequence as shown in Table 1 , 2, or 3. 

9. The nucleic acid sequence of claim 7, wherein the nucleic acid sequence is codon 
optimized for a specific host cell. 

10. The nucleic acid sequence of claim 7 joined to a detectable label. 

1 1 . The nucleic acid sequence of claim 7, wherein said nucleic acid sequence is DNA. 

12. The nucleic acid sequence of claim 11, wherein the DNA is cDNA. 
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13. The nucleic acid sequence of claim 7, wherein the nucleic acid sequence is RNA. 

14. The nucleic acid sequence of claim 13, wherein the RNA is mRNA. 

1 5. A vector comprising a polynucleotide encoding the TRAG polypeptide fragment of 
claim L 

16. The vector of claim 15, wherein the nucleic acid is operably linked to at least one 
control sequence capable of being recognized by a host cell transformed with the vector. 

17. A host cell comprising the vector of claim 16. 

1 8. A process for producing TRAG polypeptide fragments comprising culturing the host 
cell of claim 17 under conditions such that the TRAG polypeptide fragment is produced. 

19. A TRAG polypeptide fragment produced by the method of claim 18. 

20. A TRAG antisense oligonucleotide comprising a nucleotide sequence that is 
complementary to an mRNA encoding a polypeptide comprising a TRAG polypeptide 
fragment of claim 1 . 

21. A chimeric molecule comprising a TRAG polypeptide fragment of claim 1 fused to a 
heterologous amino acid sequence. 

22. An isolated TRAG specific polypeptide comprising an F a b fragment from an antibody 
capable of specifically binding to a TRAG polypeptide fragment of claim 1. 

23. The isolated TRAG specific polypeptide of claim 22, wherein said polypeptide 
comprises an isolated antibody. 

24. The TRAG specific polypeptide of claim 23, wherein said antibody is a polyclonal, 
monoclonal, or chimeric antibody. 
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25. A method of assaying a sample for a polynucleotide encoding a TRAG polypeptide 
fragment comprising detecting the presence or absence of said polynucleotide in said 
sample utilizing a nucleic acid probe capable of hybridizing with the nucleic acid 
sequence of claim 7. 

26. A method of assaying a sample for a TRAG polypeptide fragment of claim 1 
comprising detecting the presence or absence of said TRAG polypeptide fragment in said 
sample utilizing an isolated TRAG specific polypeptide, said TRAG specific polypeptide 
comprising a F a b fragment from an antibody capable of specifically binding to the TRAG 
polypeptide fragment. 

27. A method of reducing expression of a TRAG polypeptide fragment of claim 1 in a 
cell comprising exposing the cell to an oligonucleotide of at least about 15 nucleotides 
which are complementary to a TRAG mRNA. 

28. A method for producing cell lines having an altered phenotype comprising: 

(i) transfecting in vitro mammalian cells with a DNA vector encoding a TRAG 
polypeptide fragment of claim 1; 

(ii) expressing the TRAG polypeptide fragment in said cells; and 

(iii) selecting for cells having an altered phenotype. 
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